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Glvcosvlphosphatidvlinositol Specific Phospholipase D 
Proteins and Uses Thereof 

Field of the Invention 
5 The present invention relates to 

glycosylphosphatidylinositol specific phospholipase D 
(GPI-PLD) proteins and uses of these proteins, in 
particular in the treatment of liver dysfunction. 

10 Background of the Invention 

Studies have shown that a number of cell surface proteins 
are attached to the cell membrane by covalent linkage to 
a glycosylphosphatidylinositol (GPI) anchor. It has been 
shown that the enzyme GPI-PLD cleaves the photodiester 

15 bond linking glycosylphosphatidylinositol to phosphatidic 

acid, thereby releasing anchored proteins* 

GPI-PLD enzymes are abundantly present in human and 
bovine serum (5-10pg/ml in human serum) . US Patent No: 

20 5 , 418,147 (Huang et al) describes the purification of 

GPI-PLD from bovine liver, and the subsequent cloning of 
three GPI-PLD enzymes from bovine liver, human liver and 
human pancreas cDNA libraries. This patent reports the 
full length cDNA and amino acid sequences of the GPI-PLDs 

25 from human and bovine liver, and the partial cDNA and 

amino acid sequences of the human pancreatic form of the 
enzyme. Subsequently , the full length sequence of the 
pancreatic form of GPI-PLD was reported in T sang et al 
(19 92) , and this enzyme has been found in cDNA libraries 

30 from breast, eye, spleen and tonsil. The three forms of 

the enzymes are highly homologous with the predicted 
mature protein sequences of bovine liver GPI-PLD sharing 
82% sequence identity with the human liver enzyme and 81% 
sequence identity with the human pancreatic enzyme. The 

35 amino acid sequences of human liver and pancreatic forms 

of GPI-PLD were deposited at GenBank under accession 
numbers L11701 and L11702 and consist of 841 and 840 
amino acids respectively. The human liver and pancreatic 
forms of GPI-PLD share 94.6% sequence identity. The 



structure of GPI-PLDs is further discussed in Scallon et 
al, 1991. 

However, despite cloning three forms of GPI-PLD, there is 
no suggestion in these references as to the in vivo role 
of the enzymes . Further, the only application of the 
enzymes suggested is in an expression system in which a 
heterologous protein is expressed in a host cell as a 
fusion with a GPI- signal peptide, leading to the 
heterologous protein becoming anchored to the cell 
membrane by a GPI anchor, where it can be cleaved off by 
coexpressed or added GPI - PUD . 

GPI-PLD has also been isolated from human serum by Hoener 
et al (1992) and this form of the enzyme was found to be 
identical to the human pancreatic GPI-PLD apart from 
changes at 531 to 534 where VIGS is replaced by MLGT. 
This paper also showed that treatment of serum GPI-PLD 
with N-glycosidase F reduced the apparent molecular 
weight from 123 kD to 87 kD. Similarly, by Li et al 

(1994) showed GPI-PLD was cleaved by trypsin into 3 
fragments (2 x 40 kD and 30 kD) , and by Heller et al 

(1994) which showed that 33, 39 and 47kD species were 
produced, with only the N- terminal 39 kD fragment moiety 

showing enzyme activity after renaturation. 

It has been proposed that one function of GPI-PLD enzyme 
is to produce inositolphosphoglycans (IPGs) by the 
cleavage of "free" GPls in the plasma membrane in 
response to binding of a growth factor to its receptor 
(Rademacher et al, 1994). This role for GPI-PLD has been 
demonstrated in mast cells where IgE- dependent activation 
of these cells results in release of their granule 
contents, which include substances such as histamine, a 
mediator of the inflammatory response. In the presence 
of antigen, histamine is released; this release can be 
mimicked by addition of IPGs and is blocked by addition 



of anti-GPI-PLD antibodies (Lin et al, 1991) . 

The role of GPI-PLD in cleaving GPI- anchored proteins, 
and especially inositolphosphoglycans (IPGs) , is examined 
in Jones et al (1997) . However, the authors reflect the 
uncertainty in the art regarding the mechanism of IPG 
generation, noting that "The definitive activated enzyme, 
being a GPI -PLC or a GPI-PLD, has yet to be unequivocally 
identified" and that "little attention has been payed to 
the role of GPI-PLD as the hydrolysing enzyme". 

Deeg et al (1994) employed fractionation to look at the 
association of GPI-PLD with high- density lipoproteins 
(HDL) in human plasma and found that most of the GPI-PLD 
in human plasma was associated with apolipoprotein Al 
(Al) . However, the authors conclude that "the 
significance of the GPI-PLD association with Al is 
unknown". Similar results were reported in Hoener et al 
(199 3) , which notes that the complex between the two 
species was virtually inactive, although it was capable 
of hydrolyzing solubilised GPI -anchored substrate 
efficiently. 

In summary, despite the cloning of GPI-PLD enzymes and 
investigation as to their biochemical properties, the 
role of the enzyme in vivo or any possible medical use 
remains unknown. 

Summary of the Invention 

Broadly, the present invention relates to GPI-PLD for 
medical use, and in particular to the use of GPI-PLD in 
the treatment of liver dysfunction, optionally in 
combination with apolipoprotein Al. 

Accordingly, in a first aspect, the present invention 
provides the use of GPI-PLD for the preparation of a 
medicament for the treatment of liver dysfunction. 



Preferably, the GPI-PLD is administered in combination 
with apolipoprotein Al. 

In a further aspect, the present invention provides a 
method of treating a patient having liver dysfunction, 
the method comprising administering to the patient a 
therapeutically effective amount of GPI-PLD. 

In all of the above aspects, GPI-PLD can be administered 
alone or in conjunction with other treatments for liver 
dysfunction, either sequentially or simultaneously. 

In a further aspect, the present invention provides a kit 
comprising a composition including GPI-PLD, and 
optionally apolipoprotein Al, and a second composition 
for the treatment of liver dysfunction. 

in a further aspect, the present invention provides a 
pharmaceutical composition comprising a nucleic acid 
molecule encoding a GPI-PLD protein and apolipoprotein 
Al. 

These and other aspects of the present invention are 
described in more detail below. 

By way of example, embodiments of the present invention 
will now be described in more detail with reference to 
the accompanying figures. 

Brief Description of the Figures 

Figure 1 shows an alignment of the deduced amino acid 
sequences of GPI-PLD encoded by cDNA clone Al and the 
bovine and human liver GPI-PLD sequences disclosed in US 
Patent No: 5,418,147 (Huang et al) . 

Figure 2 shows the nucleic acid sequence from cDNA clone 
Al aligned with the pancreatic forms of GPI-PLD disclosed 



in US Patent No: 5,418,147 (Huang et al) (partial 
sequence) and the corresponding full length nucleic acid 
sequence deposited at GenBank. 

Figure 3 shows the amino acid sequences of the GPI-PLDs 
in clones al, b2 and d3 , and consist of 840, 795 and 510 
amino acids respectively. 

Figure 4 shows the nucleic acid sequence of cDNA clone al 
encoding GPI-PLD, consisting of 2832 bp. 

Figure 5 shows the nucleic acid sequence of cDNA clone b2 
encoding GPI-PLD, consisting of 2472 bp. 

Figure 6 shows the nucleic acid sequence of cDNA clone d3 
encoding GPI-PLD, consisting of 1942 bp. 

Figure 7 shows an alignment of the deduced amino acid 
sequences of GPI-PLDs encoded by cDNA clones al, b2 and 
d3 with the pancreatic form of the enzyme deposited at 
GenBank under accession number 11702. 

Figure 8 shows an alignment of the nucleic acid sequences 
from cDNA clones al, b2 and d3 with the cDNA sequence 
encoding the human pancreatic form of GPI-PLD deposited 
at GenBank under accession number 11702. 

Detailed Description 
GPI-PLD Proteins 

The term "GPI-PLD biological activity" is herein defined 
as the enzymatic activity of GPI-PLD in cleaving the 
photodiester bond linking glycosylphosphatidylinositol to 
phosphatidic acid, e.g. releasing a GPI- anchored protein. 
As noted in Heller et al (1994) , this activity has been 
localised to the N- terminal 39 kD portion of full length 
GPI-PLD. 



The medical uses of GPI-PLD described herein can use the 
novel GPI-PLD variants or the forms of the enzyme 
disclosed in the prior art. In either event, the skilled 
person can use the techniques described herein and others 
well known in the art to produce large amounts of these 
proteins, or fragments or active portions thereof, for 
use as pharmaceuticals, in the developments of drugs and 
for further study into its properties and role in vivo. 

In a further aspect of the present invention provides a 
polypeptide having the amino acid sequence shown in 
figure 3, which may be in isolated and/or purified form, 
free or substantially free of material with which it is 
naturally associated- In one embodiment, the clone al 
has an amino acid sequence consisting of 840 amino acids, 
a 23 amino acid signal peptide and a 817 amino acid 
mature protein. 

GPI-PLD proteins which are amino acid sequence variants, 
alleles or derivatives can also be used in the present 
invention. A polypeptide which is a variant, allele or 
derivative may have an amino acid sequence which differs 
from that given in figures 1 or 3 by one or more of 
addition, substitution, deletion and insertion of one or 
more amino acids. Preferred polypeptides have GPI-PLD 
enzymatic function as defined above. 

A GPI-PLD protein which is an amino acid sequence 
variant, allele or derivative of an amino acid sequence 
shown in figures 1 or 3 may comprise an amino acid 
sequence which shares greater than about 70%, greater 
than about 80%, greater than about 9 0%, greater than 
about 95%, greater than about 97%, greater than about 9 8% 
or greater than about 99% sequence identity with an amino 
acid sequence shown in figures 1 or 3. Sequence 
comparison and identity calculations were carried out 
using the Cluster program (Thompson et al, 1994) , using 



the following parameters (Pairwise Alignment Parameters: 
Weight Matrix: pam series; Gap Open Penalty: 10.00; Gap 
Extension Penalty: 0.10). Alternatively, the GCG program 
could be used which is available from Genetics Computer 
Group, Oxford Molecular Group, Madison, Wisconsin, USA, 
Version 9.1. Particular amino acid sequence variants may 
differ from those shown in figures 1 and 3 by insertion, 
addition, substitution or deletion of 1 amino acid, 2, 3, 
4, 5-10, 10-20 20-30, 30-50, 50-100, 100-150, or more 
than 150 amino acids. 

The present invention also includes the use of active 
portions, fragments and derivatives of the GPI-PLD 
proteins . 

An "active portion" of GPI-PLD protein is a polypeptide 
which is less than said full length GPI-PLD protein, but 
which retains at least one its essential biological 
activity, e.g. the enzyme activity mentioned above. For 
instance, portions of GPI-PLD protein can act as 
sequestrators or competitive antagonists by interacting 
with other proteins. 

A "fragment" of the GPI-PLD protein means a stretch of 
amino acid residues of at least about 5 to 7 contiguous 
amino acids, often at least about 7 to 9 contiguous amino 
acids, typically at least about 9 to 13 contiguous amino 
acids and, most preferably, at least about 20 to 30 or 
more contiguous amino acids . 

A "derivative" of the GPI-PLD protein, or a fragment 
thereof, means a polypeptide modified by varying the 
amino acid sequence of the GPI-PLD protein, e.g. by 
manipulation of the nucleic acid encoding the protein or 
by altering the protein itself. Such derivatives of the 
natural amino acid sequence may involve insertion, 
addition, deletion or substitution of one, two, three. 
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five or more amino acids, without fundamentally altering 
a biological activity of the wild type GPI-PLD protein. 

A polypeptide according to the present invention may be 
isolated and/or purified (e.g. using an antibody) for 
instance after production by expression from encoding 
nucleic acid (for which see below) . Polypeptides 
according to the present invention may also be generated 
wholly or partly by chemical synthesis. The isolated 
and/or purified polypeptide may be used in formulation of 
a composition, which may include at least one additional 
component, for example a pharmaceutical composition 
including a pharmaceutically acceptable excipient, 
vehicle or carrier. A composition including a 
polypeptide according to the invention may be used in 
prophylactic and/or therapeutic treatment as discussed 
below. 

The GPI-PLD polypeptides can also be linked to a coupling 
partner, e.g. an effector molecule, a label, a drug, a 
toxin and/or a carrier or transport molecule. Techniques 
for coupling the peptides of the invention to both 
peptidyl and non-peptidyl coupling partners are well 
known in the art. In one embodiment, the carrier 
molecule is a 16 aa peptide sequence derived from the 
homeodomain of Antennapedia (e.g. as sold under the name 
"Penetratin" ) , which can be coupled to a peptide via a 
terminal Cys residue. The "Penetratin" molecule and its 
properties are described in W09 1/18981. 

Pharmaceutical Compositions 

As mentioned above, GPI-PLD proteins can used for 
treating liver dysfunction, optionally in conjunction 
with other treatments for these disorders. Preferably, 
the GPI-PLD is administered with apolipoprotein Al, and 
more preferably, as a complex with this substance. The 
isolation of apolipoprotein Al is described in Hoener et 
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al (1993) , Deeg et al (1994) and Brewer et al (1986) . 
The compositions can be used to treat liver dysfunction 
conditions which are characterised by reduced levels of 
apolipoprotein Al and/or GPI-PLD and/or apolipoprotein 
5 Al /GPI-PLD complex. 

Thus, the GPI-PLD protein and/or apoliprotein Al can be 
formulated in pharmaceutical compositions, which may 
comprise, in addition to one of the above substances, a 

10 pharmaceutical ly acceptable excipient, carrier, buffer, 

stabiliser or other materials well known to those skilled 
in the art. Such materials should be non- toxic and 
should not interfere with the efficacy of the active 
ingredient. The precise nature of the carrier or other 

15 material may depend on the route of administration, e.g. 

oral, intravenous, cutaneous or subcutaneous, nasal, 
intramuscular, intraperitoneal routes . 

Pharmaceutical compositions for oral administration may 
20 be in tablet, capsule, powder or liquid form. A tablet 

may include a solid carrier such as gelatin or an 
adjuvant. Liquid pharmaceutical compositions generally 
include a liquid carrier such as water, petroleum, animal 
or vegetable oils, mineral oil or synthetic oil. 
25 Physiological saline solution, dextrose or other 

saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, or 
30 injection at the site of affliction, the active 

ingredient will be in the form of a parenterally 
acceptable aqueous solution which is pyrogen -free and has 
suitable pH, isotonicity and stability. Those of 
relevant skill in the art are well able to prepare 
35 suitable solutions using, for example, isotonic vehicles 

such as sodium chloride injection, Ringer's injection, 
lactated Ringer's injection. Preservatives, stabilisers, 



10 

buffers, antioxidants and/or other additives may be 
included as required. 

Whether it is a polypeptide, peptide, nucleic acid 
5 molecule, small molecule or other pharmaceutically useful 
compound of the invention that is to be given to an 
individual, administration is preferably in a 
"prophylactically effective amount" or a "therapeutically 
effective amount" (as the case may be, although 

10 prophylaxis may be considered therapy) , this being 

sufficient to show benefit to the individual. The actual 
amount administered, and rate and time-course of 
administration, will depend on the nature and severity of 
what is being treated. Prescription of treatment, e.g. 

15 decisions on dosage etc, is within the responsibility of 

general practitioners and other medical doctors, and 
typically takes account of the disorder to be treated, 
the condition of the individual patient, the site of 
delivery, the method of administration and other factors 

20 known to practitioners. Examples of the techniques and 

protocols mentioned above can be found in Remington's 
Pharmaceutical Sciences, 16th edition, Oslo, A. (ed) , 
1980. 

25 GPI-PLD proteins can be administered alone or in 

combination with other treatments for liver dysfunction, 
either simultaneously or sequentially. 



GPI-PLD nucleic acid 

"GPI-PLD nucleic acid" includes a nucleic acid molecule 
which has a nucleotide sequence encoding a polypeptide 
which includes the amino acid sequence shown in figures 4 
to 6, and in some embodiments of the invention extends to 
the known human liver and pancreatic forms of GPI-PLD 
(L11701 and L11702) . These forms of GPI-PLD have been 
mapped to human chromosome 6 and are contained in the 4 
centimorgan region of D6S1660-D6S1558 at positions 95.95 
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and 99.71 (NCBI GeneMap'98). This corresponds to the 
cytogenetic region of 6p21.3. This region also contains 
the IDDM1 and HLA loci (although the HLA genes map to the 
adjacent D6S1558 -D6S1616 interval). The mouse GPI-PLD 
5 gene has also been mapped to chromosome 13, near the fim 

1 locus, which is found in humans on chromosome 6. 



The GPI-PLD coding sequence may be that shown in figures 
2, 4 to 6 or 8, a complementary nucleic acid sequence, or 

10 it may be a mutant, variant, derivative or allele of 

these sequences. The sequence may differ from that shown 
by a change which is one or more of addition, insertion, 
deletion and substitution of one or more nucleotides of 
the sequence shown. Changes to a nucleotide sequence may 

15 result in an amino acid change at the protein level, or 

not, as determined by the genetic code. 

The encoded polypeptide may comprise an amino acid 
sequence which differs by one or more amino acid residues 

20 from the amino acid sequence shown in the figures. 

Nucleic acid encoding a polypeptide which is an amino 
acid sequence mutant, variant, derivative or allele of 
the sequence shown in figures 1, 3 or 7 is further 
provided by the present invention. Such polypeptides are 

25 discussed below. Nucleic acid encoding such a 

polypeptide may show greater than about 70% identity, 
greater than about 80% identity, greater than about 90% 
identity, greater than about 95% identity, greater than 
about 98% identity, or greater than about 99% identity 

30 with a sequence shown in the figures. 

The present invention also includes fragments of the GPI- 
PLD nucleic acid sequences described herein, the 
fragments preferably being at least 12, 15, 30, 45, 60, 
35 or 120 nucleotides in length. 



Generally, nucleic acid according to the present 



10 
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invention is provided as an isolate, in isolated and/or 
purified form, or free or substantially free of material 
with which it is naturally associated, such as free or 
substantially free of nucleic acid flanking the gene in 
the human genome* except possibly one or more regulatory 
sequence (s) for expression. Nucleic acid may be wholly 
or partially synthetic and may include genomic DNA, cDNA 
or RNA. Where nucleic acid according to the invention 
includes RNA, reference to the sequence shown should be 
construed as reference to the RNA equivalent, with U 
substituted for T. 



Nucleic acid sequences encoding all or part of the GPI- 
PLD gene and/or its regulatory elements can be readily 
15 prepared by the skilled person using the information and 
references contained herein and techniques known in the 
art (for example, see Sambrook, Fritsch and Maniatis, 
"Molecular Cloning, A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, 1989, and Ausubel et al. Short 
20 protocols in Molecular Biology, John Wiley and Sons, 

1992) . These techniques include (i) the use of the 
polymerase chain reaction (PCR) to amplify samples of 
such nucleic acid, e.g. from genomic sources, (ii) 
chemical synthesis, or (iii) amplification in E. coll. 
25 Modifications to the GPI-PLD sequences can be made, e.g. 

using site directed mutagenesis, to provide expression of 
modified GPI-PLD protein or to take account of codon 
preference in the host cells used to express the nucleic 
acid. 

30 

in order to obtain expression of the GPI-PLD nucleic acid 
sequences, the sequences can be incorporated in a vector 
having control sequences operably linked to the GPI-PLD 
nucleic acid to control its expression. The use of 
35 expression systems has reached an advanced degree of 

sophistication. The vectors may include other sequences 
such as promoters or enhancers to drive the expression of 
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the inserted nucleic acid, nucleic acid sequences so that 
the GPI-PLD protein is produced as a fusion and/or 
nucleic acid encoding secretion signals so that the 
polypeptide produced in the host cell is secreted from 
5 the cell. GPI-pld protein can then be obtained by 

transforming the vectors into host cells in which the 
vector is functional, culturing the host cells so that 
the GPI-PLD protein is produced and recovering the GPI- 
PLD protein from the host cells or the surrounding 

10 medium. Prokaryotic and eukaryotic cells are used for 
this purpose in the art, including strains of E. coli, 
yeast, and eukaryotic cells such as COS or CHO cells. 
The choice of host cell can be used to control the 
properties of the GPI-PLD protein expressed in those 

15 cells, e.g. controlling where the polypeptide is 

deposited in the host cells or affecting properties such 
as its glycosylation and phosphorylation. 

PCR techniques for the amplification of nucleic acid are 

20 described in US Patent No: 4,683,195. In general, such 

techniques require that sequence information from the 
ends of the target sequence is known to allow suitable 
forward and reverse oligonucleotide primers to be 
designed to be identical or similar to the polynucleotide 

25 sequence that is the target for the amplification. PCR 

comprises steps of denaturation of template nucleic acid 
(if double- stranded) , annealing of primer to target, and 
polymerisation. The nucleic acid probed or used as 
template in the amplification reaction may be genomic 

30 DNA, cDNA or RNA. PCR can be used to amplify specific 

sequences from genomic DNA, specific RNA sequences and 
cDNA transcribed from mRNA, bacteriophage or plasmid 
sequences. The GPI-PLD protein nucleic acid sequences 
provided herein readily allow the skilled person to 

35 design PCR primers. References for the general use of 

PCR techniques include Mullis et al, Cold Spring Harbor 
Symp. Quant. Biol., 51:263, 1987; Ehrlich (ed) , PCR 
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Technology, Stockton Press, NY, 1989; Ehrlich et al, 
Science, 252:1643-1650, 1991; "PCR protocols; A Guide to 
Methods and Applications", Eds. innis et al, Academic 
Press, New York, 1990. 

5 

Nucleic acid according to the present invention is 
obtainable using one or more oligonucleotide probes or 
primers designed to hybridize with one or more fragments 
of the nucleic acid sequence shown in the figures, 
10 particularly fragments of relatively rare sequence, based 
on codon usage or statistical analysis. A primer 
designed to hybridize with a fragment of the nucleic acid 
sequence shown in the above figures may be used in 
conjunction with one or more oligonucleotides designed to 
15 hybridize to a sequence in a cloning vector within which 
target nucleic acid has been cloned, or in so-called 
-RACE" (rapid amplification of CDNA ends) in which cDNA's 
in a library are ligated to an oligonucleotide linker and 
PCR is performed using a primer which hybridizes with a 
20 GPI-PLD nucleic acid sequence shown in figures and a 

primer which hybridizes to the oligonucleotide linker. 

Such oligonucleotide probes or primers, as well as the 
full-length sequence (and mutants, alleles, variants and 
25 derivatives) are also useful in screening a test sample 
containing nucleic acid for the presence of alleles, 
mutants and variants, especially those that lead to the 
production of inactive forms of GPI-PLD protein protein, 
the probes hybridizing with a target sequence from a 
30 sample obtained from the individual being tested. The 

conditions of the hybridization can be controlled to 
minimise non-specific binding, and preferably stringent 
to moderately stringent hybridization conditions are 
preferred. The skilled person is readily able to design 
35 such probes, label them and devise suitable conditions 
for the hybridization reactions, assisted by textbooks 
such as Sambrook et al (1989) and Ausubel et al (1992). 
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Examples of "stringent conditions" are those which: (1) 
^ employ low ionic strength and high temperature for 

washing, for example 0.015 M sodium chloride/0.0015 M 
sodium citrate/0.1% sodium dodecyl sulphate at 50 °C; (2) 
5 employ during hybridisation a denaturing agent, such as 

formamide, for example, 50% (v/v) formamide with 0.1% 
BSA/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium 
phosphate buffer at pH 6.5 with 750mM sodium chloride, 
75mM sodium citrate at 42°C; or (3) employ 50% formamide, 
10 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50mM 

sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA 
(50ug/ml) , 0.1% SDS, and 10% dextran sulphate at 42°C, 
with washes at 42°C in 0.2 x SSC and 50% formamide at 
15 55°C, followed by high stringency wash consisting of 0.1 

x SSC containing EDTA at 55°C. These hybridisation 
conditions may be used in the context of defining nucleic 
acid sequences which hybridize with GPI-PLD nucleic acid 
sequences . 
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Uses of GPI-PLD Nucleic Acid 

The GPI-PLD nucleic acid sequences can be used in the 
preparation of cell lines capable of expressing GPI-PLD 
and in gene therapy techniques . 



Thus, the present invention provides a cell line for 
transplantation into a patient, the cell line being 
transformed with nucleic acid encoding GPI-PLD, and being 
capable of expressing and secreting GPI-PLD. In one 
embodiment, the cell lines are encapsulated, e.g. in a 
biocompatible polymer, so that the GPI-PLD produced by 
the cell line can be secreted into the patient, while 
preventing rejection by the immune system of the host. 
Methods for encapsulating cells in biocompatible polymers 
35 are described in W093/16687 and W096/31199. 



a further alternative, the nucleic acid encoded the 
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GPI-PLD protein could be used in a method of gene 
therapy, to treat a patient who is unable to synthesize 
the active polypeptide or unable to synthesize it at the 
normal level, thereby providing the effect provided by 
wild-type GPI-PLD protein and suppressing the occurrence 
of liver dysfunction in the target cells. 

vectors such as viral vectors have been used in the prior 
art to introduce genes into a wide variety of different 
target cells. Typically, the vectors are exposed to the 
target cells so that transfection can take place in a 
sufficient proportion of the cells to provide a useful 
therapeutic or prophylactic effect from the expression of 
the desired polypeptide. The transfected nucleic acid 
may be permanently incorporated into the genome of each 
of the targeted tumour cells, providing long lasting 
effect, or alternatively the treatment may have- to be 
repeated periodically. 

A variety of vectors, both viral vectors and plasmid 
vectors, are known in the art, see US Patent No: 
5,252,479 and WO93/07282. In particular, a number of 
viruses have been used as gene transfer vectors, 
including papovaviruses , such as SV40, vaccinia virus, 
herpesviruses, including HSV and EBV, and retroviruses. 
Many gene therapy protocols in the prior art have used 
disabled murine retroviruses. 

As an alternative to the use of viral vectors other known 
methods of introducing nucleic acid into cells includes 
electroporation, calcium phosphate co -precipitation, 
mechanical techniques such as microinjection, transfer 
mediated by liposomes and direct DNA uptake and receptor - 
mediated DNA transfer. 

As mentioned above, the aim of gene therapy using nucleic 
acid encoding the GPI-PLD protein, or an active portion 
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thereof, is to increase the amount of the expression 
product of the nucleic acid in cells in which the level 
of the wild- type GPI-PLD protein is absent or present 
only at reduced levels. Target cells for gene therapy 
include insulin secreting (3- cells or any neuron derived 
cells* Cell engineering can be used to provide the 
overexpression or repression of GPI-PLD protein in 
transfected cell lines which can then be subsequently 
transplanted to humans . Gene therapy can be employed 
using a promoter to drive GPI-PLD protein expression in a 
tissue specific manner (i.e. an insulin promoter linked 
to GPI-PLD cDNA will overexpress GPI-PLD protein in p- 
cells and transiently in the brain) . If defective 
function of GPI-PLD protein is involved in neurological 
disease, GPI-PLD protein can be overexpressed in 
transformed cell lines for transplantation. 

Gene transfer techniques which selectively target the 
GPI-PLD nucleic acid to target tissues are preferred. 
Examples of this included receptor-mediated gene 
transfer, in which the nucleic acid is linked to a 
protein ligand via poly lysine, with the ligand being 
specific for a receptor present on the surface of the 
target cells. 

Diagnostic Methods 

Methods for determining the concentration of analytes in 
biological samples from individuals are well known in the 
art and can be employed in the context of the present 
invention to determine the presence or amount of GPI-PLD 
in a biological sample from a patient. This in turn can 
allow a physician to determine whether a patient suffers 
from liver dysfunction, and so optimise the treatment of 
it. 

Broadly, the methods divide into those screening for the 
presence of GPI-PLD protein nucleic acid sequences and 
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those that rely on detecting the presence or absence of 
the GPI-PLD protein polypeptide. The methods make use of 
biological samples from individuals that are suspected of 
contain the nucleic acid sequences or polypeptide. 

These diagnostic methods can employ biological samples 
such as blood, serum, tissue samples or urine. In view 
of the fact that the activity of GPI-PLD is thought to be 
due to the level of the enzyme circulating in serum, the 
use of serum or blood samples is preferred. 

The assay methods for determining the amount or 
concentration of GPI-PLD protein typically either employ 
binding agents having binding sites capable of 
specifically binding to GPI-PLD in preference to other 
molecules or measure a characteristic biological activity 
of GPI-PLD. Examples of binding agents include 
antibodies, receptors and other molecules capable of 
specifically binding the enzyme. Conveniently, the 
binding agent (s) are immobilised on solid support, e.g. 
at defined locations, to make them easy to manipulate 
during the assay. 

The sample is generally contacted with the binding 
agent (s) under appropriate conditions so that GPI-PLD 
present in the sample can bind to the binding agent (s). 
The fractional occupancy of the binding sites of the 
binding agent (s) can then be determined using a 
developing agent or agents. Typically, the developing 
agents are labelled (e.g. with radioactive, fluorescent 
or enzyme labels) so that they can be detected using 
techniques well known in the art. Thus, radioactive 
labels can be detected using a scintillation counter or 
other radiation counting device, fluorescent labels using 
a laser and confocal microscope, and enzyme labels by the 
action of an enzyme label on a substrate, typically to 
produce a colour change. The developing agent (s) can be 
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used in a competitive method in which the developing 
agent competes with the analyte for occupied binding 
sites of the binding agent, or non- competitive method, in 
which the labelled developing agent binds analyte bound 
5 by the binding agent or to occupied binding sites. Both 

methods provide an indication of the number of the 
binding sites occupied by the analyte, and hence the 
concentration of the analyte in the sample, e.g. by 
comparison with standards obtained using samples 
10 containing known concentrations of the analyte. 

Experiment a 1 

The present invention is based on the realisation that 
GPI-PLD can be used in the treatment of liver 

15 dysfunction, and in particular combination with 

apoliprotein Al to which it is bound in human serum and 
blood. As GPI-PLD is transported in blood complexed with 
apolipoprotein Al, liver dysfunction, and especially 
dysfunction characterised by reduced apolipoprotein Al 

20 levels, can be treated using GPI-PLD. 

Screening of human liver cDNA library 

A human liver cDNA library (Gibco BRL, cat # 10422-012, 
lot # HF4703) was screened for GPI-PLD, resulting in the 
25 isolation of 3 cDNA clones. The nucleic acid sequences 

of the clones are shown in figures 4 to 6, with the 
deduced amino acid sequences shown in figure 3. 

Clone al represents the full length cDNA. There are only 
30 two differences within the coding region of this sequence 

when compared to that of the human GPI-PLD pancreatic 
form described in the GenBank database (accession number 
L11702) . These are a g to a conversion at positions 88 
(L11702) , 199 (al) and a t to g conversion at positions 
35 797 (L11702), 908 (al). Interestingly this latter this 

latter conversion creates a unique Hindlll restriction 
site in the al clone. Both conversions result in amino 
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acid differences, the first changes amino acid 30 from a 
valine in L11702 to an isoleucine in al, and the second 
changes amino acid 266 from an isoleucine in L11702 to a 
serine in al. Clone al also differs from L11702 in that 
5 it contains 5' untranslated region (UTR) and only shares 

the first 168 bases of the 3' UTR before terminating in a 
poly-A tail . 

Clone b2 lacks the exon of GPI-PLD, which begins at 
10 position 2469 in the al nucleotide sequence- However, 

the sequence from here to the end of b2 (2444-2473) does 
not contain a stop codon. It is therefore not clear 
whether b2 represents a cDNA with a different final exon 
or is the produce of aberrant processing. 

15 

Clone d3 shared the coding 3' UTR sequence of the al 
clone from al position 1119 onwards, however the initial 
1008 base pairs of coding sequence are absent from this 
clone. Clone d3 contains a methionine initiation codon 

20 in frame to the coding sequence at position 202 and a 

unique 5' UTR. Translation of d3 from this codon would 
result in a unique sequence of 6 amino acids (1-6) . 
Clone d3 therefore appears to represent a true 
transcript, in that it contains initiation and stop 

25 codons and both 5' and 3' UTRs . The predicted protein 

product of this transcript would apparently lack the 
catalytic domain, which has been localised to the N- 
terminus of the GPI-PLD enzyme (amino acids 1-375), 
however the 3 EF hand -like domains would still be 

30 present. 

Huang et al and Tsang et al (1992) reported that two 
variants or isoenzymes of GPI-PLD exist, the so-called 
liver and pancreatic forms (accession numbers L11701 and 
35 11702) • Other workers have detected L11702 cDNAs in 

human breast, eye, spleen, tonsil, and pancreas, as well 
as in liver. However, we failed to detect the liver form 
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of GPI-PLD in the liver or in any other tissues. 
Gene mapping and localisation 

The chromosomal gene isolated in the experiments above is 
5 about 20-30 kb in length. The gene was also isolated on 

a PAC and mapped by fluorescence -in situ hybridisation 
(FISH) to 6p21.3, agreeing with recent radiation hybrid 
maps as seen on GeneMap'98; NCBI) . The IDDM1 
susceptibility gene also maps to 6p21.3, although recent 

10 evidence suggests that at least two closely- linked loci 

for IDDM1 are in the MHC region. The MHC locus itself 
seems to map to a region adjoining the GPI-PLD locus 
rather than within the same microsatellite band, so the 
significance of the proximity of the GPI-PLD and IDDMl 

15 loci is unclear. 

Northern blots of the mRNA species found in liver have 
shown two presumed splice variants as well as the full- 
length transcript. One has a deletion of about 160 amino 
20 acids from the mature 817 amino acid protein. The other 

seems to be a C- terminal deletion, which may therefore be 
non- functional if other authors are correct in finding 
that the C- terminus is necessary for enzyme activity. 

25 The predominant GPI-PLD species detected after tissue 

extraction by antibodies (Western blots) has apparent 
molecular weight of about 47 kD, which agrees with other 
authors that full-length GPI-PLD is taken up from the 
plasma and processed to smaller active species. 

30 

GPI-PLD obtained from serum by cells is required for 
second messenger signalling 

The principle goal of these experiments was to determine 
the role of glycosylphosphat idyl inositol phospholipase D 
35 (GPI-PLD) in a type one hypersensitivity reaction. This 

reaction involved the cross - linking of IgE receptors on 
the mast cell surface, leading to the release of allergic 
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mediators . 

Such an allergic reaction has been experimentally 
reproduced in our laboratory, using a rat basophilic 
5 leukaemia cell line, RBL-2H3. These cells naturally have 

unoccupied IgE receptors (FceRl, or high-affinity 
receptors) , allowing them to be passively sensitised with 
an IgE isotype of choice. 

10 RBL-2H3 cell culture was maintained in Eagles minimum 

essential medium, containing 10% Foetal Bovine Serum 
(FBS) (heat activated) , 100 U/ml Penicillin, 100 ug/ml 
Streptomycin and 2 mM L-glutamine. 

15 Previous research indicates that RBL-2H3 cells derive 

their GPI-PLD from the culture serum (data not shown) . 
Therefore, it follows that inactivation of this external 
source of GPI-PLD would deprive the cells of any further 
enzyme . 

20 

Inactivation of GPI-PLD activity in foetal bovine serum 
was achieved according to the method of Kung et al 
(Biochimica et Biophysica Acta, 1357:329-338, 1997). 
Briefly, FCS was adjusted to pH 11 using concentrated 

25 hydrochloric acid, and incubated for 1 hour at 37°C 

using . After this time, the pH was adjusted to 7.4, and 
GPI-PLD activity was determined using an enzymatic assay 
(Davitz et al, J. Biol. Chem. , 264:13760-13764, 1989). 
Results indicated that this alkaline incubation severely 

30 depleted GPI-PLD activity (data not shown) . 

To determine the effect of culture of RBL-2H3 cells in 
GPI-PLD inactive serum, the supplemented MEM was replaced 
with MEM in which the FBS had been inactivated. Although 
35 the cell appearance was not dramatically altered by the 

altered culture conditions, determination of GPI-PLD 
activity showed a dramatic reduction in activity. 
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GPI-PLD activity in cells cultured with GPI-pld 
active/inactive FBS: 

Active = 0.66 units GPI-PLD activity /mg of protein. 

5 

Inactive =0.11 units GPI-PLD activity /mg of protein. 

The effect of a reduced gpi-pld activity on the cell's 
ability to respond to igE cross -linking was determined 
10 follows: 
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35 



as 



RBL-2H3 cells were grown to confluence, after which time 
the adherent cells were removed from the culture flask 
using a cell scraper. The cell density was determined, 
15 using a haemocytometer, and adjusted to 2 x 10 5 per ml. 

The cells were seeded at 1 ml per well in a 24 well 
culture plate and cultured for overnight at 37 °c in a 
humidified 5% C0 2 incubator. 

20 The overnight culture media was aspirated and replaced 

with fresh media containing Rat IgE anti-DNP at 3 mg/ml. 
After a 2 hour incubation period, the media was 
aspirated, and the cells were washed twice, with hepes 
Buffered Saline. Cross -linking was achieved by the 
25 addition of 200 ul of DNP -Albumin at 100 ng/ml, and 

incubation for 2 hours. Mediator release was determined 
using a colorimetric assay to detect the presence of p- 
hexosaminidase and compared with the total cell 3- 
hexosaminidase content (as determined by incubation with 
200 yl 5% Triton x-100 detergent). (Yasuda et al, int. 
imunol., 7:251-258, 1995). As shown in the table below, 
the responsiveness to cross -linking was significantly 
reduced in those cells that were cultured in GPI-pld 
inactive media. 



Percentage release in IgE linking activity assay 
(compared with total) 
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Active GPI-PLD culture = 48.79% 

inactive GPI-PLD culture = 5.07% 

5 phosphorylation of GPI-PLD 

The phosphorylation state of the GPI-PLD enzymes can be 
determined using MALDI-TOF mass spectrometry as described 
by Yip & Hutchins (1992) . Spectrums of tryptic digests 
of the four proteins can be compared before and after 
10 treatment with calf intestinal alkaline phosphatase. The 

specific kinases responsible for phosphorylation of GPI- 
PLD can then be determined by incubation of the GPI-PLD 
tryptic fragments with ATP in the presence of various 
kinases. Motif analysis of the amino acid sequence of 
15 human GPI-PLD using the HGMP "motif" package has revealed 

the presence of numerous potential phosphorylation sites 
for two enzymes: protein kinase C and protein kinase ck2 
(formerly known as casine kinase II) . These enzymes may 
therefore be involved in the activation of GPI-PLD. 
20 intriguingly the activity of protein kinase ck2 has been 

shown to be modulated by IPGs (Alemany et al, 1990) and 
there is also indirect evidence suggesting that IPGs may 
act through protein kinase C, thus suggesting the 
possibility of feedback loops regulating the production 
25 of IPGs. 

GPI-PLD as a metal ion transferase 

Two families of IPGs exist. IPGs of the P-type stimulate 
incorporation of glucose into glycogen whereas the A -type 
30 ipgs stimulate incorporation of glucose into lipid. 

Metal ion analysis has shown that the P-type IPGs contain 
manganese and the A- type zinc. It is known that the 
serum form of GPI-PLD contains approximately 10 atoms of 
zinc per molecule, investigation can therefore show 
35 whether the different isoforms of human GPI-PLD produce 
IPGs with differing metal ion content. 
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This experiment can be performed in two ways. Firstly 
purified A-type and P-type IPGs can be extracted from rat 
liver (Caro et al, 1997) and their metal ions removed 
using dithiazone in chloroform. The IPGs can be 
5 incubated in the presence of radiosotopes of zinc < 65 Zn 2+ ) 

and manganese ( 52 Mn 2+ ) respectively. The radiolabelled 
IPGs can then be added to the different isoforms of 
purified GPI-PLD (as determined in the above experiments) 
in the absence of GPI substrate thus driving the reaction 

10 from product (IPG) to substrate (GPI) . It can then be 

determined whether or not the GPI-PLD protein have 
incorporated radioactive metal ions from the IPGs. The 
reverse situation will also be examined, whereby the 
metal ions of GPI-PLD isoforms are replaced by the 

15 respective radioisotopes. GPI-PLD can then be incubated 

with GPIs extracted from membrane preparations and the 
resulting IPG products analysed for incorporation of 
radioisotope. These experiments will thus determine 
whether or not GPI-PLD is responsible for the transfer of 

20 divalent cations (Mn 2+ or Zn 2+ ) to its IPG products. 

Site of action 

The function of the enzyme in releasing GPI -anchored 
proteins, and its postulated function as the generator of 
25 IPG second messengers require the enzyme to be active at 

the cell surface. It is known that GPI -anchored proteins 
accumulate in clusters in caveolae, an uncoated pit 
membrane specialisation, and so this is a good potential 
site for GPI-PLD activity. Analysis of the primary 
structure of the protein predicts a secondary structural 
arrangement of four amphipathic helices, thus suggesting 
that the protein can interact with lipids in membranes. 
Previous experiments have demonstrated significant 
amounts of the enzyme in the lyososmal fraction but not 
35 in the cytosol. The location of GPI-PLD will be examined 

by staining tissues with anti -GPI-PLD antibodies , 
followed by a gold particle -labelled second antibody. 
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Tissue can then be prepared for transmission electron 
microscopy and the location of the GPI-PLD protein 
determined. Caveolae will also be produced according to 
the protocol of Chang et al (1994), which involves three 
5 rounds of sucrose step gradient ultracentrifugation. 

Caveolae -enriched proteins will then be separated by SDS- 
PAGE and electrophorectically transferred to 
nitrocellulose membranes. We can then use the anti-GPl- 
PLD antibody to determine if GPI-PLD is present in these 
10 membrane specialisations. 

Activation of GPI-PLD 

If GPI-PLD is found to be phosphorylated by protein 
kinase C and/or protein kinase ck2 by MALDI-TOF 
15 spectrometry, the interaction of these proteins can be 

confirmed using immunoprecipitation since antibodies to 
GPI-PLD, protein kinase C and protein kinase ck2 have all 
be produced. The yeast two hybrid system can also be 
used to identify other proteins which interact with GPI- 
20 PLD in the cell. The yeast two hybrid systems (Chen et 

al, 1991) is based on the property of the yeast 
transcriptional activator Gal4, which is separable into 
DNA binding and transcriptional activating domains. GPI- 
PLD cDNAs can be cloned in frame into the DNA binding 
25 domain vector. This will be co- trans fee ted into an 

appropriate yeast host strain along with a library of 
cDNAs cloned into the activation domain vector, 
interaction of a protein with GPI-PLD will therefore 
result in localisation of the activation and DNA binding 
30 domains, and hence transcription of the galactosidase 

reporter gene. Clones containing interacting proteins 
are then identified by the colour reaction they produce. 
The advantage of this system is that the gene encoding 
the interacting protein is immediately available for 
35 sequence analysis and thus identification. The use of 

this system has enabled identification of many 
interacting proteins and the system available in kit form 
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from Clontech. This also provides a method of screening 
for sustances which are capable of activating GPI-PLD, 
e.g. for further development as lead compounds. 



5 Discussion 

GPI-PLD is a metalloenzyme with 5 and 10 atoms per 
molecule of calcium and zinc, respectively. It 
circulates in a complex with apolipoprotein Al . GPI-PLD 
is produced in the pancreas by both a and (J -cells in the 

10 islets of Langerhans. It is also produced by a mouse 

insulinoma cell line (TC3) , with GPI-PLD and insulin 
generally colocalised intracellularly . The enzyme was 
shown to be secreted in response to insulin 
secretagogues . Both isoenzymes of GPI-PLD also seem to 

15 be present in liver; a major part of the activity could 

be washed away from the tissue by extraction with 
detergent -free buffer (thus, likely to be the plasma 
enzyme) . There is some suggestions that the liver, as 
well as the pancreas, may contribute to the serum pool of 

20 GPI-PLD as patients with liver disease have lower levels 

of active enzyme, which is correlated with the reduced 
albumin levels. 
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Figure 1: Alignment of GPI-PLD deduced amino acid sequences 
Top: protein produced from cDNA clone Al 

Mid: protein produced from Roche patent bovine liver sequence 
Bot: protein produced from Roche patent human liver sequence 



MSAFRLWPGLLIMLG-SLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDA 
MSAFR FWS GLLMLLG- FLC PRSS PCGI STHIE I GHRALE FLHLQDGS INYKELLLRHQDA 
MS AFR LWP GL LMI VMAS LCHR GS SCGLSTHIE I GHRALE FLHLHNGHVNYKELLLEHQDA 

YQAGI VFPDC FY PS I CKGGKFHDVSE S THWT PFLNAS VH Y I RENY PLPWE KDTEKLVAFL 
YQAGSVFPDSFYPSICERGQFHDVSESTHWTPFLNASVHYIRKNYPLPWDEDTEKLVAFL 
YQAGTVFPDC F Y PS LCKGGKFHDVS E S THWT PFLNAS VHY I RENY PL PWEKDTE KLVAFL 

FGI TS HMAADVSWHS LGLEQGFLRTMGAI D FHGS Y SEAHS AGDFGGDVLS Q FE FNFNY LA 
FGITSHMVADVNWHSLGIENGFLRTMAAIDFHNSYPEAHPAGDFGGDVLSQFEFKFNYLS 
FGI TS HMVADVSWHS LGI E QGFLRTMGAI D FHGS YSEAHS AGDFGGDVLS Q FE FNFNY LA 

RRWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFL 
RHWYVPAEDLLGI YRELYGRI VI TKKAI VDCS YLQFLEMYAEMLAI SKLYPT YSVKS PFL 
RRWYVPVKDLLGIYEKLYGREVITENVIVDCSHIQFLEMYGEMLAVSKLY PSYSTKSPFL 

VEQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPENPLFIACGGQQNHTQG 

VEQFQEYFLGGLEDMAFWSTNIYHLTSTMLKNGTSNCNLPENP LFITCGGQQNNTHG 

VEQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCSLFENPENPLFIACGGQQNHTQG 

SKMQKNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFI YKALERNIRTMFIG 
SKVQKNGFHKNVTAALTKNIGKHINYTKRGVFFSVDSWTMDFLSFMYKSLERSIREMFIG 
SKMQKNDFHRNLTSSLTENIDRNINYTERGVFFSVNSWT PDSMSFIYKALERNVRTMFIG 

GSQLSQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRV 
SSQP-LTHVS SPAAS YYLSFPYTRLGWAMTSADLNQDGYGDLVVGAPGYSHPGRIHVGRV 
GSQLSQKHISSPLASYFLSFPYARLGWAMTSADLNQDGYGDLWGAPGYSRPGRIHIGRV 

YLIYGNDLGLPPVDLDLDKEAHRILEGFQPSGRFGS7VLAVLDFNVDGVPDLAVGAPSVGS 
YLIYGNDLG-PRIDLDLDKEAHGILEGFQPSGRFGSAVAVLDFNVDGVPDLAVGAPSVGS 
YLIYGNELGLPPVDLDLDKEAHGILEGFQPSGRFGSALAMLDFNMDGVPDLAVGAPSVGS 

EQLTYKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPD-LVIGSP 
EKLTYTGAVYVYFGSKQGQLSSSPNVTISCQDTYCNLGWTLLAADVDGDSEPDLFVIGSP 
EQLTYKGAVYVYFGSKQGRMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPD-LVIGSP 

FAPGGGKQKGI VAAFYS GPS LS DKEKLNVEAANWTVRGE EDFSWFGYS LHGVTVTDNRTLL 
FAFGGGKQKGIVAAFYSGSSYSSREKLNVEAANWMVKGEEDFAWLGYSLHGVNVNNRTLL 
FAPGGGKQKGI VAAFYSGPSLSNKEKLNVEAANWTVRGEEDFAWFGYSLHGVTVDNRTLL 

LVGSPTWKNASRLGHLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSSGH 
LAGS PTWKDTS SQGHLFRTRDEKQS PGRVYGY FPPI CQSWFTI SGDKAMGKLGTS LS S GH 

LVGSPTWKNASRLGRLLHIRDEKKSLGRVYGYFPPNSQSWFTIVGDKAMGKLGTSLSSGH 

VLMNGTLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRF 
VIVNGTRTQVLLVGAPTQDWSKS-FLTMTLHQGGSTRMYELTPDSQPSLLSTFSGNRRF 
VLMNGTLTQVLLVGAPTRDDVSKMAFLTMTLHQGGATRMYALTSDLQPPLLSTFSGDRRF 

SRFGGVLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKC 
SRFGGVLHLSDLDNDGLDEIIVAAPLRITDATAGLMGEEDGRVYVFNGKQITVGDVTGKC 
SRFGGVLHLSDLDDDGVDEIIVAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKC 

KSWI T PC PE E KAQ Y VLI S P EAS S R FGS S LI TVRS KAKNQ WI AAGRS S LGARLS GALHVY 
KSWVTPCPEEKAQYVLISPEAGSRFGSSVITVRSKEKNQVIIAAGRSSLGARLSGVLHIY 
KSWMTPCPEEICAQYVL I SPEASSR FGS S LI TVRS KAKNQVVIAAGRSS LGARLS GALHVY 



SLGSD 
RLGQD 
SLGSD 
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Figure 2: Alignment of human GPI-PLD nucleic acid sequences 

Top: pancreatic- form cDNA sequence from GenBank database 
mid: our sequence cloned from human liver cDNA library 
bot: Roche patent pancreatic-form partial cDNA sequence 



1 GTGACCTGCTTAGAGAGAAGCGGTGGGTCTGCACCTGGATTTTGGAGTCCCAGTGCTGCT 60 

1 ATGTCTGCT 9 

61 GCAGCTCTGAGCXATTCCCACGTCACCAGAGAAGCCGGTGGGCAATGAGAGCATGTCTGCT 120 

10 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 69 

121 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 180 

70 TGTGGCCTTTCAACACACGTAGAAATAGGACACAGAGCTCTGGAGTTTCTTCAGCTTCAC 12 9 

181 TGTGGCCTTTCAACACACATAGAAATAGGACACAGAGCTCTGGAGTTTCTTCAGCTTCAC 240 

130 AATGGGCGTGTTAACTACAGAGAGCTGTTACTAGAACACCAGGATGCGTATCAGGCTGGA 189 

241 AATGGGCGTGTTAACTACAGAGAGCTGTTACTAGAACACCAGGATGCGTATCAGGCTGGA 300 

190 ATCGTGTTTCCTGATTGTTTTTACCCTAGCATCTGCAAAGGAGGAAAATTCCATGATGTG 249 

301 A ^ G ^ TTCCTGATTGTTOT ^ 360 

250 TCTGAGAGCACTCACTGGACTCCGTTTCTTAATGCAAGCGTTCATTATATCCGAGAGAAC 309 

361 TCTGAGAGCACTCACTGGACTCCGTTTCTTAATGCAAGCGTTCATTATATCCGAGAGAAC 420 

310 TATCCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTTCTTGTTTGGAATTACT 369 

421 TATCCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTTCTTGTTTGGAATTACT 480 

370 TCTCACATGGCGGCAGATGTCAGCTGGCATAGTCTGGGCCTTGAACAAGGATTCCTTAGG 429 

481 TCT^C^TGGCGGCAGATGTCAGCTGGCATAGTCTGGGCCTTGAACAAGGATTCCTTAGG 540 

430 ACCATGGGAGCTATTGATTTTCACGGCTCCTATTCAGAGGCTCATTCGGCTGGTGATTTT 489 

541 ACCATGGGAGCTATTGATTTTCACGGCTCCTATTCAGAGGCTCATTCGGCTGGTGATTTT 600 

490 GGAGGAGATGTGTTGAGCCAGTTTGAATTTAATTTTAATTACCTTGCACGACGCTGGTAT 549 

601 GGAGGAGATGTGTTGAGC^GTTTGAATTTAATTTTAATTACCTTGCACGACGCTGGTAT 660 

550 609 



661 GTGCCAGTCAAAGATCTACTGGGAATTTATGAGAAACTGTATGGTCGAA^GTCAT^CC 



720 



610 I?!^*™^^ 669 



721 GAAAATGTAATCGTTGATTGTTCACATATCC^GTTCTTAGAAATGTATGGTGAGATGCTA 



780 



670 ^'SZiiiJiS^^ 729 



781 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 



840 



4 • 4-/IS • 



730 CAAGAGTATTTTCTTGGAGGACTGGATGATATGGCATTTTGGTCCACTAATATTTACCAT 78 9 
841 CAAGAGTATTTTCTTGGAGGACTGGATGATATGGCATTTTGGTCCACTAATATTTACCAT 900 



7 90 CTAACAAT CTT CAT GTT GGAGAAT GGGACCAGT GACT GCAAC CT GC CT GAGAACCCT CT G 8 49 
901 CTAACAAGCTT CAT GTT GGAGAAT GGGACCAGT GACT GCAAC CTGC CT GAGAACCCT CT G 960 



850 T T CAT T GCAT GT GGC GGC CAGCAAAAC CAC AC C CAGGGCT CAAAAAT GC AGAAAAAT GAT 909 
961 T T CAT T GCAT GT GGC GGC CAGCAAAAC C ACAC C CAGGGCT CAAAAAT GCAGAAAAAT GAT 1020 



910 TTTCACAGAAATTTGACTACATCCCTAACTGAAAGTGTTGACAGGAATATAAACTATACT 969 
1021 TTTCACAGAAATTTGACTACATCCCTAACTGAAAGTGTTGACAGGAATATAAACTATACT 1080 



970 GAAAGAGGAGTGTTCTTTAGTGTAAATTCCTGGACCCCGGATTCCATGTCCTTTATCTAC 1029 
1081 GAAAGAGGAGT GTT CTTTAGT GTAAATTC CTGGAC CCC GGATT C CAT GT CCTTTATCTAC 1140 



1030 AAGGCTTT GGAAAGGAACATAAGGACAAT GTT CATAGGTGGCT CT CAGTTGT CACAAAAG 1089 
1141 AAGGCTTT GGAAAGGAACATAAGGACAAT GTT CATAGGT GGCT CT CAGTTGT CACAAAAG 1200 



1090 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1149 
1201 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1260 



1150 GCAAT GAC CT CAGCT GAC CT CAACCAGGAT GGGCAC GGT GAC CT CGT GGT GGGC GCAC CA 1209 
1261 GCAAT GACCT CAGCT GAC CT CAAC CAGGAT GGGCAC GGT GAC CT C GT GGT GGGC GCAC CA 132 0 



1210 GGCTACAGCCGC CC CGGC CACAT CCACAT CGGGC GC GT GTAC CT CAT CTAC GGCAAT GAC 1269 
1321 GGCTACAGCCGCCC CGGC CACAT CCACAT C GGGC GC GT GTAC CT CAT CTAC GGCAAT GAC 138 0 



127 0 CT GGGC CT GC CACCTGTT GACCT GGAC CT GGACAAGGAGGC C CACAGGAT CCTT GAAGGC 1329 
1381 CT GGGCCTGCCACCT GTT GAC CT GGAC CT GGACAAGGAGGC CCACAGGAT CCTT GAAGGC 1440 



1330 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 138 9 
1441 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTJ\ACGTGGACGGC 1500 



1390 GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 144 9 
1501 GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 1560 



1450 GCC GT GTAT GT CTACTTT GGTT CGAAACAAGGAGGAAT GT CTT CTT C C C CTAACAT CACC 1509 
1561 GCCGTGTATGTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 1620 



1510 ATTT CTT GCCAGGACATCTACTGTAACTTGGGCTGGACTCT CTT GGCT GCAGATGTGAAT 1569 
1621 ATTT CTT GCCAGGACATCTACTGTAACTTGGGCTGGACTCT CTT GGCT GCAGATGTGAAT 168 0 



1570 GGAGACAGT GAACC C GAT CT GGT CAT C GGCT CCCCTTTT GCACCAGGT GGAGGGAAGCAG 162 9 
1681 GGAGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 1740 



1630 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 168 9 
1741 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAA7VAACTGAAC 1800 
1 CTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 35 
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1690 GT GGAGGCAGC CAAC T GGAC GGT GAGAGGC GAGGAAGAC T T C T C C T GGT T T GGAT AT T C C 1749 
1801 GT GGAGGCAGC GAACT GGAC GGT GAGAGGC GAGGAAGACTTCT C CT GGTTT GGAT ATT C C 1860 
36 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 95 

1750 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1809 
1861 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1920 
96 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 155 

1810 AATGCCAGCAGGCTGGGCCATTTGTTACACATCCGAGATGAGAAAAAGAGCCTTGGGAGG 1869 
1921 AATGCCAGCAGGCTGGGCCATTTGTTACACATCCGAGATGAGAAAAAGAGCCTTGGGAGG 1980 
156 AAT GC CAGCAGGC T GGGC CATT T GT T ACACAT C C GAGAT GAGAAAAAGAGC CT T GGGAGG 215 

187 0 GTGTATGGCTACTTCCCACCAAACGGCCAAAGCTGGTTTACCATTTCTGGAGACAAGGCA 1929 
1981 GTGTATGGCTACTTCCGACCAAACGGCCAAAGCTGGTTTACGATTTCTGGAGACAAGGCA 2040 
216 GTGTATGGCTACTTCC-ACCAAACGGCCAAAGCTGGTTTACCATTTCTGGAGACAAGGCA 275 

1930 ATGGGGAAACTGGGTACTTCCCTTTCCAGTGGCCACGTACTGATGAATGGGACTCTGAAA 1989 
2041 ATGGGGAAACTGGGTACTTCCCTTTCCAGTGGCCACGTACTGATGAATGGGACTCTGAAA 2100 
27 6 ATGGGGAAACTGGGTACTTCCCTTTCCAGTGGTCACGTACTGATGAATGGGACTCTGAAA 335 

1990 CAAGT GCTGCTGGTTGGAGC CCCTAC GTACGAT GAC GT GT CTAAGGT GGCATT C CT GACC 2049 
2101 CAAGTGCTGCTGGTTGGAGCCCCTACGTACGATGACGTGTCTAAGGTGGCATTCCTGACC 2160 
336 CAAGTGCTGCTGGTTGGAGCCCCTACGTACGATGACGTGTCTAAGGTGGCATTCCTGACC 395 

2050 GTGACCCTACAC CAAGGCGGAGCCACTC GCAT GTACGCACT CATATCTGACGC GCAGCCT 2109 
2161 GTGACCCTACAC CAAGGC GGAGC CACT C GCAT GTACGCACTCATATCT GACGC GCAGC CT 2220 
396 GTGACC CTACAC CAAGGC GGAGC CACT C GCGT GTAC GCACT CATATCT GAC GC GCAGCCT 455 

2110 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2169 
2221 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2280 
456 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 515 

2170 TTGAGT GAC CT GGAT GATGATGGCTT AGAT GAAAT CAT CAT GGCAGC CCC CCT GAGGATA 2229 
2281 TT GAGT GAC CTGGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGC C C CCCT GAGGATA 2340 
516 TTGAGT GAC CTGGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCC CC C CT GAGGATA 575 

2230 GCAGAT GTAAC CT CTGGACT GATT GGGGGAGAAGAC GGC CGAGTATAT GTATATAAT GGC 2289 
2341 GCAGAT GTAAC C T CT GGAC T GAT T GGGGGAGAAGAC GGC C GAGT AT AT GTATATAAT GGC 2400 
576 GCAGAT GTAACCT CT GGACT GATT GGGGGAGAAGAC GGCC GAGTATAT GTATATAAT GGC 635 

2290 AAAGAGAC CACCCTT GGT GACATGACT GGCAAAT GCAAAT CAT GGATAACT CCATGT CCA 2349 
2401 AAAGAGAC CAC CCTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGATAACTCCATGT CCA 2460 
636 AAAGAGAC CAC CCTT GGT GACAT GAC T GGCAAAT GCAAAT CAT GGATAACT C CAT GT C CA 695 

2350 GAAGAAAAGGCCCAATATGTATTGATTTCTCCTGAAGCCAGCTCAAGGTTTGGGAGCTCC 2409 
2461 GAAGAAAAGGCCCAATAT GTATT GATTT CTC CT GAAGC CAGCT CAAGGTTT GGGAGCT C C 2520 
696 GAAGAAAAGGCGCAATATGTATTGATTTCTCCTGAAGCCAGCTCAAGGTTTGGGAGCTCC 755 

2410 CTCATCACCGTGAGGTCCAAGGCAAAGAACCAAGTCGTCATTGCTGCTGGAAGGAGTTCT 2469 
2521 CTCATCACCGTGAGGTCCAAGGCAAAGAACCAAGTCGTCATTGCTGCTGGAAGGAGTTCT 258 0 
756 CTCATCACCGTGAGGTCCAAGGGAAAGAACCAAGTCGTCATTGCTGCTGGAAGGAGTTCT 815 

2470 TTGGGAGCCCGACTCTCCGGGGCACTTCACGTCTATAGCCTTGGCTCAGATTGAAGATTT 2529 
2581 TTGGGAGCCCGACTCTCCGGGGCACTTCACGTCTATAGCCTTGGCTCAGATTGAAGATTT 2640 
816 TTGGGAGCCCGACTCTCCGGGGCACTTCACGTCTATAGCCTTGGCTCAGATTGAAGATTT 875 

2530 CACTGCATTTCCCCACTCTGCCCACCTCTCTCATGCTGAATCACATCCATGGTGAGCATT 2589 
2641 CACTGCATTTCCCCACTCTGCCCACCTCTCTCATGCTGAATCACATCCATGGTGAGCATT 2700 
876 CACT GCATTT C C CCACT CT GC C CACCT CT CT CAT GCT GAAT CACAT C CAT GGT GAGCATT 935 

2590 TT GAT GGACAAAGT GGCACATCCAGT GGAGC GGT GGT AGAT C CTGATAGACAT GGGGCT C 2649 
2701 TTGATGGACAAAGTGGCACATCCAGTGGAGCGGTGGTAGATCCTGATAGACATGGGGCTC 2760 
936 T T GAT GGACAAAGT GGCACAT C CAGT GGAGC GGT GGT AGAT CCT GAT AGACAT GGGGCT C 995 



2.4- 



2 650 CTGGGA 2655 

2761 CTGGGA 2766 

996 CTGGGACAGTG7\ACCCGJ\TCTGGTC^TCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCA 1055 

2656 GT AGAGAGACAC AC T AACAGC CACAC C C T C T G 2687 

2767 GT AGAGAGACAC AC T AACAGC CACAC C C T C T G 27 98 

1056 GAAGGGAATTGTGGCTGCGTTTTATTGAGTAGAGAGACACACTAACAGCCACACCCTCTG 1115 

2 68 8 GAAAT CT GATACAGTAAATATAT GACT GCACCAGAAATAT GT GAAATAGCAGACATT CT G 2747 

27 99 GAAAT C T GATACAGTAAATATAT GACT GCAC C AG 2833 

1116 GAAAT CT GATACAGTAAATATAT GACT ACAC C AGAAAT AT GT GAAAT AGCAGACAT T C T G 1175 

2748 CTTACTCATGTCTCCTTCCACAGTTTACTTCCTCGCTCCCTTTGCATCTAAACCTTTCTT 2 807 

1176 CTTACTCATGTCTCCTTCCACAGTTTACTTCCTCGCTCCCTTTGCATCTAAACCTTTCTT 1235 

2808 CTTTCCC7\ACTTATTGCCTGTAGTCAGACCTGCTGTACAACCTATTTCCTCTTCCTCTTG 2867 

1236 CTTTCCCAACTTATTGCCTGTAGTC 1261 

28 68 AATGTCTTTCCAGTGGCTGGAAAGGTCCCTCTGTGGTTATCTGTTAGAACAGTCTCTGTA 2 927 



2928 CACAATTCCTCCTAAAAACATCCTTTTTTAA/\AAAAGAATTGTTCAGCCATAAAGAAAGA 2987 
2 988 ACAAGAT CAT GC C CTT T GCAGGGACAT GGAT GGAGCT GGAGGC CATTAT C CTT CATAAAC 3 047 
3048 TATTGGAGGAAGAGAAAACCAAACACTCC71TATTCTGACTTGTAAGTGGGAGCTAAGTGA 3107 
3108 GAACAC GT GGACAC AT AGAGGGAAAC AACAC AC AC T GGGGC C TAT GAGAGGGC GGAAGGT 3167 
3168 GGGAGGAGGGAGAGATCAGGAAAAATAACTAATGGATACTTAGGGTGATGAAATAATCTG 3227 
3228 T GTAACAAAC C C CCAT GACACAC CTTTAT GT AT GTAACAAACCAGCACTT CCTGC GCAT G 3287 
3288 TAC C C CT GAACTTAAAAGTTAAAAAAAAGTT GAACTTAAAAATAAGAGATTGGC C CAT GC 3347 



3348 CAAT CAAAGTATAATAGAAAGCAT AGT ATAC 337 8 



Figure 3: Amino acid sequences of GPI-PLD al, b2 and d3 . 



cDNA clone <13 

MILLFQDSMSFIYKALERNIRTMFIGGSQLSQKHVSSPIJ^YFLSFPYARLGWAMTSADL 
NQDGHGDLWGAPGYSRPGHIHIGRVYLIYGNDLGLPPVDLDLDKEAHRILEGFQPSGRF 
GSAIAVLDFNVDGVPDIAVGAPSVGSEQLTKGAVYVYFGSKQGC»4SSSPNITISCQDIYC 
NLGWTLLAADVNGDS E PDLVI GS PFAPGGGKQKGI VAAFYS GPSLSDKEKLNVEAANWTV 
RGEEDFSWFGYSLHGVTVDNRTLLLVGSPTWKNASRLGHLLHIRDEKKSLGRVYGYFPPN 
GQSWFTISGDKAMGKLGTSLSSGHVLMNGTLKQVLLVGAPTYDDVSKVAFLTVTLHQGGA 
TRMYALISDAQPLLLSTFSGDRRFSRFGGVLHLSDLDDDGLDEIIMAAPLRIADVTSGLI 
GGEDGRVYVYNGKETTLGDMTGKCKSWITPCPEEKAQYVLISPEASSRFGSSLITVRSKA 
KNQVVIAAGRSSLGARLSGALHVYSLGSD 



cDNA clone b2 

MSAFRLWPGLLIMLGSLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDAY 
QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 
GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 
RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 
EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 
KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKALERNIRTMFIGGSQL 
SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 
GNDLGLP PVDLDLDKEAHR I LE GFQ PS GR FGS ALAVLD FNVDGVPDLAVGAPS VGS EQLT 
YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLIaAADVNGDSE PDLVI GSPFAPGG 
GKQKGI VAAFYS GPS LSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 
TWKNAS RLGHLLHI RDE KKS LGRVYGY FP PNGQSWFT I S GDKAMGKLGTS LS S GHVLMNG 
TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 

VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 
PCPEEKVSEKKKKKK 



cDNA clone al 

MSAFRLWPGLLIMLGSLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDAY 
QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 
GITSHMA7UDVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 
RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 
EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 
KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKALERNIRTMFIGGSQL 
SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 
GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 
YKGAVYVY FGS KQGGMS S S PNI T I S CQDI Y CNLGWTLLAADVNGDSE PDLVI GS PFAPGG 

GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 
TWKNASRLGHLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSS GHVLMNG 
TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 
VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 
PCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVYSLGSD 






ure 4: Human GPI-PLD cDNA clone al 



32 bp: 690 a 



688 c 



735 g 



719 t 



1 gtgacctgct tagagagaag cggtgggtct gcacctggat tttggagtcc cagtgctgct 
61 gcagctctga gcattcccac gtcaccagag aagccggtgg gcaatgagag catgtctgct 
121 ttcaggttgt ggcctggcct gctgatcatg ttgggttctc tctgccatag aggttcaccg 
181 tgtggccttt caacacacat agaaatagga cacagagctc tggagtttct tcagcttcac 
241 aatgggcgtg ttaactacag agagctgtta ctagaacacc aggatgcgta tcaggctgga 
301 atcgtgtttc ctgattgttt ttaccctagc atctgcaaag gaggaaaatt ccatgatgtg 
361 tctgagagca ctcactggac tccgtttctt aatgcaagcg ttcattatat ccgagagaac 
421 tatccccttc cctgggagaa ggacacagag aaactggtag ctttcttgtt tggaattact 
481 tctcacatgg cggcagatgt cagctggcat agtctgggcc ttgaacaagg attccttagg 
541 accatgggag ctattgattt tcacggctcc tattcagagg ctcattcggc tggtgatttt 
601 ggaggagatg tgttgagcca gtttgaattt aattttaatt accttgcacg acgctggtat 
661 gtgccagtca aagatctact gggaatttat gagaaactgt atggtcgaaa agtcatcacc 
721 gaaaatgtaa tcgttgattg ttcacatatc cagttcttag aaatgtatgg tgagatgcta 
781 gctgtttcca agttatatcc cacttactct acaaagtccc cgtttttggt ggaacaattc 
841 caagagtatt ttcttggagg actggatgat atggcatttt ggtccactaa tatttaccat 
901 ctaacaagct tcatgttgga gaatgggacc agtgactgca acctgcctga gaaccctctg 
961 ttcattgcat gtggcggcca gcaaaaccac acccagggct caaaaatgca gaaaaatgat 
1021 tttcacagaa atttgactac atccctaact gaaagtgttg acaggaatat aaactatact 
1081 gaaagaggag tgttctttag tgtaaattcc tggaccccgg attccatgtc ctttatctac 
1141 aaggctttgg aaaggaacat aaggacaatg ttcataggtg gctctcagtt gtcacaaaag 
1201 cacgtctcca gccccttagc atcttacttc ttgtcatttc cttatgcgag gcttggctgg 
1261 gcaatgacct cagctgacct caaccaggat gggcacggtg acctcgtggt gggcgcacca 
1321 ggctacagcc gccccggcca catccacatc gggcgcgtgt acctcatcta cggcaatgac 
1381 ctgggcctgc cacctgttga cctggacctg gacaaggagg cccacaggat ccttgaaggc 
1441 ttccagccct caggtcggtt tggctcggcc ttggctgtgt tggactttaa cgtggacggc 
1501 gtgcctgacc tggccgtggg agctccctcg gtgggctccg agcagctcac ctacaaaggt 
1561 gccgtgtatg tctactttgg ttccaaacaa ggaggaatgt cttcttcccc taacatcacc 
1621 atttcttgcc aggacatcta ctgtaacttg ggctggactc tcttggctgc agatgtgaat 
1681 ggagacagtg aacccgatct ggtcatcggc tccccttttg caccaggtgg agggaagcag 
1741 aagggaattg tggctgcgtt ttattctggc cccagcctga gcgacaaaga aaaactgaac 
1801 gtggaggcag ccaactggac ggtgagaggc gaggaagact tctcctggtt tggatattcc 
1861 cttcacggtg tcactgtgga caacagaacc ttgctgttgg ttgggagccc gacctggaag 
1921 aatgccagca ggctgggcca tttgttacac atccgagatg agaaaaagag ccttgggagg 
1981 gtgtatggct acttcccacc aaacggccaa agctggttta ccatttctgg agacaaggca 
2041 atggggaaac tgggtacttc cctttccagt ggccacgtac tgatgaatgg gactctgaaa 
2101 caagtgctgc tggttggagc ccctacgtac gatgacgtgt ctaaggtggc attcctgacc 
2161 gtgaccctac accaaggcgg agccactcgc atgtacgcac tcatatctga cgcgcagcct 
2221 ctgctgctca gcaccttcag cggagaccgc cgcttctccc gatttggtgg cgttctgcac 
2281 ttgagtgacc tggatgatga tggcttagat gaaatcatca tggcagcccc cctgaggata 
2341 gcagatgtaa cctctggact gattggggga gaagacggcc gagtatatgt atataatggc 
2401 aaagagacca cccttggtga catgactggc aaatgcaaat catggataac tccatgtcca 
24 61 gaagaaaagg cccaatatgt attgatttct cctgaagcca gctcaaggtt tgggagctcc 
2521 ctcatcaccg tgaggtccaa ggcaaagaac caagtcgtca ttgctgctgg aaggagttct 
2581 ttgggagccc gactctccgg ggcacttcac gtctatagcc ttggctcaga ttgaagattt 
2641 cactgcattt ccccactctg cccacctctc tcatgctgaa tcacatccat ggtgagcatt 
2701 ttgatggaca aagtggcaca tccagtggag cggtggtaga tcctgataga catggggctc 
2761 ctgggagtag agagacacac taacagccac accctctgga aatctgatac agtaaatata 
2821 tgactgcacc ag 



Figure 5: Human GPI-PLD cDNA clone b2 
2472 bp: 617 a 588 c 639 g 628 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 



gtctgcacct 

agagaagccg 

catgttgggt 

aggacacaga 

gttactagaa 

tagcatctgc 

tcttaatgca 

agagaaactg 

gcatagtctg 

ctcctattca 

atttaatttt 

ttatgagaaa 

tatccagttc 

ctctacaaag 

tgatatggca 

gaccagtgac 

ccacacccag 

aactgaaagt 

ttcctggacc 

aatgttcata 

cttcttgtca 

ggatgggcac 

catcgggcgc 

cctggacaag 

ggccttggct 

ctcggtgggc 

acaaggagga 

cttgggctgg 

cggctcccct 

tggccccagc 

aggcgaggaa 

aaccttgctg 

acacatccga 

ccaaagctgg 

cagtggccac 

gtacgatgac 

tcgcatgtac 

ccgccgcttc 

agatgaaatc 

gggagaagac 

tggcaaatgc 



ggattttgga 
gtgggcaatg 
tctctctgcc 
gctctggagt 
caccaggatg 
aaaggaggaa 
agcgttcatt 
gtagctttct 
ggccttgaac 
gaggctcatt 
aattaccttg 
ctgtatggtc 
ttagaaatgt 
tccccgtttt 
ttttggtcca 
tgcaacctgc 
ggctcaaaaa 
gttgacagga 
ccggattcca 
ggtggctctc 
tttccttatg 
ggtgacctcg 
gtgtacctca 
gaggcccaca 
gtgttggact 
tccgagcagc 
atgtcttctt 
actctcttgg 
tttgcaccag 
ctgagcgaca 
gacttctcct 
ttggttggga 
gatgagaaaa 
tttaccattt 
gtactgatga 
gtgtctaagg 
gcactcatat 
tcccgatttg 
atcatggcag 
ggccgagtat 
aaatcatgga 



aaaaaaaaaa aa 



gtcccagtgc 

agagcatgtc 

atagaggttc 

ttcttcagct 

cgtatcaggc 

aattccatga 

atatccgaga 

tgtttggaat 

aaggattcct 

cggctggtga 

cacgacgctg 

gaaaagtcat 

atggtgagat 

tggtggaaca 

ctaatattta 

ctgagaaccc 

tgcagaaaaa 

atataaacta 

tgtcctttat 

agttgtcaca 

cgaggcttgg 

tggtgggcgc 

tctacggcaa 

ggatccttga 

ttaacgtgga 

tcacctacaa 

cccctaacat 

ctgcagatgt 

gtggagggaa 

aagaaaaact 

ggtttggata 

gcccgacctg 

agagccttgg 

ctggagacaa 

atgggactct 

tggcattcct 

ctgacgcgca 

gtggcgttct 

cccccctgag 

atgtatataa 

taactccatg 



tgctgcagct 

tgctttcagg 

accgtgtggc 

tcacaatggg 

tggaatcgtg 

tgtgtctgag 

gaactatccc 

tacttctcac 

taggaccatg 

ttttggagga 

gtatgtgcca 

caccgaaaat 

gctagctgtt 

attccaagag 

ccatctaaca 

tctgttcatt 

tgattttcac 

tactgaaaga 

ctacaaggct 

aaagcacgtc 

ctgggcaatg 

accaggctac 

tgacctgggc 

aggcttccag 

cggcgtgcct 

aggtgccgtg 

caccatttct 

gaatggagac 

gcagaaggga 

gaacgtggag 

ttcccttcac 

gaagaatgcc 

gagggtgtat 

ggcaatgggg 

gaaacaagtg 

gaccgtgacc 

gcctctgctg 

gcacttgagt 

gatagcagat 

tggcaaagag 

tccagaagaa 



ctgagcattc 

ttgtggcctg 

ctttcaacac 

cgtgttaact 

tttcctgatt 

agcactcact 

cttccctggg 

atggcggcag 

ggagctattg 

gatgtgttga 

gtcaaagatc 

gtaatcgttg 

tccaagttat 

tattttcttg 

agcttcatgt 

gcatgtggcg 

agaaatttga 

ggagtgttct 

ttggaaagga 

tccagcccct 

acctcagctg 

agccgccccg 

ctgccacctg 

ccctcaggtc 

gacctggccg 

tatgtctact 

tgccaggaca 

agtgaacccg 

attgtggctg 

gcagccaact 

ggtgtcactg 

agcaggctgg 

ggctacttcc 

aaactgggta 

ctgctggttg 

ctacaccaag 

ctcagcacct 

gacctggatg 

gtaacctctg 

accacccttg 

aaggtaagtg 



ccacgtcacc 
gcctgctgat 
acatagaaat 
acagagagct 
gtttttaccc 
ggactccgtt 
agaaggacac 
atgtcagctg 
attttcacgg 
gccagtttga 
tactgggaat 
attgttcaca 
atcccactta 
gaggactgga 
tggagaatgg 
gccagcaaaa 
ctacatccct 
ttagtgtaaa 
acataaggac 
tagcatctta 
acctcaacca 
gccacatcca 
ttgacctgga 
ggtttggctc 
tgggagctcc 
ttggttccaa 
tctactgtaa 
atctggtcat 
cgttttattc 
ggacggtgag 
tggacaacag 
gccatttgtt 
caccaaacgg 
cttccctttc 
gagcccctac 
gcggagccac 
tcagcggaga 
atgatggctt 
gactgattgg 
gtgacatgac 
aaaaaaaaaa 
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Figure 6: Human GPI-PLD cDNA clone d3 
1942 bp: 455 a 496 c 502 g 489 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 



gggctgtaac 
cctgtcctaa 
ttctaaaaac 
ttgattatgc 
ctttatctac 
gtcacaaaag 
gcttggctgg 
gggcgcacca 
cggcaatgac 
ccttgaaggc 
cgtggacggc 
ctacaaaggt 
taacatcacc 
agatgtgaat 
agggaagcag 
aaaactgaac 
tggatattcc 
gacctggaag 
ccttgggagg 
agacaaggca 
gactctgaaa 
attcctgacc 
cgcgcagcct 
cgttctgcac 
cctgaggata 
atataatggc 
tccatgtcca 
tgggagctcc 
aaggagttct 
ttgaagattt 
ggtgagcatt 
catggggctc 
agtaaatata 



tctgccatcc 
gtgatcttac 
tcatttcctt 
tacgtgtctt 
aaggctttgg 
cacgtctcca 
gcaatgacct 
ggctacagcc 
ctgggcctgc 
ttccagccct 
gtgcctgacc 
gccgtgtatg 
atttcttgcc 
ggagacagtg 
aagggaattg 
gtggaggcag 
cttcacggtg 
aatgccagca 
gtgtatggct 
atggggaaac 
caagtgctgc 
gtgaccctac 
ctgctgctca 
ttgagtgacc 
gcagatgtaa 
aaagagacca 
gaagaaaagg 
ctcatcaccg 
ttgggagccc 
cactgcattt 
ttgatggaca 
ctgggagtag 
tgactgcacc 



ctcagcataa 
ttgctgatag 
tacacaagtc 
tactgtctat 
aaaggaacat 
gccccttagc 
cagctgacct 
gccccggcca 
cacctgttga 
caggtcggtt 
tggccgtggg 
tctactttgg 
aggacatcta 
aacccgatct 
tggctgcgtt 
ccaactggac 
tcactgtgga 
ggctgggcca 
acttcccacc 
tgggtacttc 
tggttggagc 
accaaggcgg 
gcaccttcag 
tggatgatga 
cctctggact 
cccttggtga 
cccaatatgt 
tgaggtccaa 
gactctccgg 
ccccactctg 
aagtggcaca 
agagacacac 
ag 



tttgggggta 
gacctaatgt 
caatactttg 
aatgattctt 
aaggacaatg 
atcttacttc 
caaccaggat 
catccacatc 
cctggacctg 
tggctcggcc 
agctccctcg 
ttccaaacaa 
ctgtaacttg 
ggtcatcggc 
ttattctggc 
ggtgagaggc 
caacagaacc 
tttgttacac 
aaacggccaa 
cctttccagt 
ccctacgtac 
agccactcgc 
cggagaccgc 
tggcttagat 
gattggggga 
catgactggc 
attgatttct 
ggcaaagaac 
ggcacttcac 
cccacctctc 
tccagtggag 
taacagccac 



tgatttcact 
tttattttat 
gacaggaaac 
ttatttcagg 
ttcataggtg 
ttgtcatttc 
gggcacggtg 
gggcgcgtgt 
gacaaggagg 
ttggctgtgt 
gtgggctccg 
ggaggaatgt 
ggctggactc 
tccccttttg 
cccagcctga 
gaggaagact 
ttgctgttgg 
atccgagatg 
agctggttta 
ggccacgtac 
gatgacgtgt 
atgtacgcac 
cgcttctccc 
gaaatcatca 
gaagacggcc 
aaatgcaaat 
cctgaagcca 
caagtcgtca 
gtctatagcc 
tcatgctgaa 
cggtggtaga 
accctctgga 



atcctaattg 
tgtttagcac 
agtagctttg 
attccatgtc 
gctctcagtt 
cttatgcgag 
acctcgtggt 
acctcatcta 
cccacaggat 
tggactttaa 
agcagctcac 
cttcttcccc 
tcttggctgc 
caccaggtgg 
gcgacaaaga 
tctcctggtt 
ttgggagccc 
agaaaaagag 
ccatttctgg 
tgatgaatgg 
ctaaggtggc 
tcatatctga 
gatttggtgg 
tggcagcccc 
gagtatatgt 
catggataac 
gctcaaggtt 
ttgctgctgg 
ttggctcaga 
tcacatccat 
tcctgataga 
aatctgatac 



4 # ti/18 # * 

Figure 7: Alignment of GPIPLD protein sequences 

database MSAFRLWPGLLIMLGSLCHRGSPCGLSTHVEIGHRALEFLQLHNGRVNYRELLLEHQDAY 60 

d3 

b2 MSAFRLWPGLLIMLGSLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDAY 60 

a 1 MS AFRLWPGLLIMLGS LCHRGS PCGLSTH I E I GHRALE FLQLHNGRVNYRE LLLE HQDAY 6 0 

database QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

d3 

b2 QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

al QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

database GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 18 0 

d3 

b2 GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 18 0 

al GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 18 0 

database RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMTGEMLAVSKLYPTYSTKSPFLV 2 40 

d3 

b2 RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 2 40 

al RWYVFVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 24 0 

database EQFQEYFLGGLDDMAFWSTNIYHLTIFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

d3 

b2 EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

al EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

database KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKALERNIRTMFIGGSQL 360 

d3 MI LLFQDSMS FI YKALERNI RTMFI GGSQL 3 0 

b2 KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFI YKALERNI RTMFI GGSQL 360 

al KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFI YKALERNI RTMFI GGSQL 360 

database SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 42 0 

d3 SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 90 

b2 SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 42 0 

al SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 420 

database GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 48 0 

d3 GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 150 

b2 GNDLGL P PVDLDLDKEAHR I LE GFQPS GRFGSALAVLDFNVDGVPDLAVGAPS VGS EQLT 48 0 

al GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 480 

database YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLIAADVNGDSEPDLVIGSPFAPGG 540 

d3 YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLIAADVNGDSEPDLVIGSPFAPGG 210 

b2 YKGAV YVY FGS KQGGMS S S PNI T I S CQD I YCNLGWT LLAADVNGDS E PDLVI GS PFAPGG 540 

al YKGAVY VY FGS KQGGMS S S PNI T I S CQD I YCNLGWT LLAADVNGDS E PD LV I GS PFAPGG 540 

database GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 600 

d3 GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 270 

b2 GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 600 

a 1 GKQKGI VAAFYS GPS LS DKEKLNVEAANWTVRGEED FS WFGYS LHGVTVDNRTLLLVGS P 600 

database TWKNAS RLGHLLH I RDE KKS LGRVYGY FPPNGQSWFT I SGDKAMGKLGTS LS SGHVLMNG 660 

d3 TWKNASRLGHLLH I RDE KKS LGRVYGY FPPNGQSWFT I S GDKAMGKLGTS LS SGHVLMNG 330 

b2 TWKNAS RLGH LLH I RDE KKS LGRVYGY FPPNGQSWFT I S GDKAMGKLGT S LS S GHVLMNG 660 

al TWKNASRLGHLLH I RDE KKS LGRVYGY FPPNGQSWFT I S GDKAMGKLGT S LS SGHVLMNG 660 

database TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 720 

d3 TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALI SDAQPLLLSTFSGDRRFSRFGG 390 

b2 TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYAL I SDAQPLLLSTFSGDRRFSRFGG 720 

al TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYAL I SDAQPLLLSTFSGDRRFSRFGG 720 



6 * 12/13 # 



database VLHIiSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 78 0 

d3 VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 450 

b2 VLHLS D LDDD GLDE 1 1 MAAPLR I ADVTS GLI GGE D GRV YVYNGKE TT LGDMT GKCKS WI T 7 80 

al VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 78 0 

database PCPEEKAQYVLISPET^JSSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVYSLGSD 8 4 0 

d3 PCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVYSLGSD 510 

b2 PCPEEKVSEKKKKKK 7 95 

al PCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQVVIAAGRSSLGARLSGAiHVYSLGSD 84 0 



Database 840 aa 

d3 510 aa 

b2 795 aa 

al 840 aa 



* 
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Figure 8: Alignment of human GPI-PLD nucleic acid sequences 

1: pancreatic-form: cDNA sequence from GenBank database (L11702) 

2 : cDNA clone Al 

3: cDNA clone B2 

4: cDNA clone D3 



1 

61 
35 



GTGACCTGCTTAGAGAGAAGCGGTGGGTCTGCACCTGGATTTTGGAGTCCCAGTGCTGCT 60 
GT CT GCAC CT GGATTTT GGAGT CC CAGT GCT GCT 34 



ATGTCTGCT 9 

GCAGCT CT GAGCATT C CCACGT CAC CAGAGAAGC CGGTGGGCAAT GAGAGCAT GT CTGCT 120 
GCAGCT CT GAGCATT C C CAC GT CAC CAGAGAAGCCGGT GGGCAAT GAGAGCAT GT CT GCT 



10 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 
121 TT CAGGTT GT GGCCT GGC CT GCT GAT CAT GTT GGGTT CT CT CT GCCATAGAGGTT CACCG 
95 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 



7 0 T GT GGC CTTT CAACACACGTAGAAATAGGACACAGAGCT CTGGAGTTT CTT CAGCTT CAC 
181 T GTGGC CTTT CAACACACAT AGAAATAGGACACAGAGCT CT GGAGTTTCTT CAGCTTCAC 
155 T GT GGC CTTT CAACACACAT AGAAATAGGACACAGAGCT CT GGAGTTT CTT CAGCTT CAC 



94 



69 
180 
154 



129 
240 
214 



130 AAT GGGCGT GTT AACTACAGAGAGCT GTT ACTAGAACACCAGGAT GC GT AT CAGGCT GGA 
241 AATGGGC GT GTTAACT ACAGAGAGCT GTTACTAGAACACC AGGAT GCGTAT CAGGCT GGA 
215 AAT GGGC GT GTTAACT ACAGAGAGCT GTTACT AGAAC AC CAGGAT GC GT AT CAGGCT GGA 27 4 



189 
300 



190 AT CGT GTTT CCT GATT GTTTTTAC CCTAGCAT CT GCAAAGGAGGAAAATT C CAT GAT GT G 
301 AT CGT GTTT CCT GATT GTTTTTAC CCT AGCAT CT GCAAAGGAGGAAAATT CCAT GAT GT G 
275 AT CGT GTTT CCT GATT GTTTTT ACC CTAGCATCT GCAAAGGAGGAAAATT C CAT GAT GT G 334 



249 
360 



250 T CT GAGAGCACT CACT GGACT C CGTTT CTTAAT GCAAGCGTT CATTAT AT C CGAGAGAAC 309 
361 T CT GAGAGCACT CACT GGACT C C GTTT CTTAAT GCAAGC GTT CATTAT AT CC GAGAGAAC 42 0 
335 T CT GAGAGCACT CACT GGACT CC GTTTCTTAAT GCAAGCGTT CATTAT AT C CGAGAGAAC 394 



310 TATCCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTTCTTGTTTGGAATTACT 369 
421 TATCCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTT CTT GTTT GGAATT ACT 480 
3 95 TAT CCCCTTCCCT GGGAGAAGGACACAGAGAAACT GGTAGCT TT CTT GTTT GGAATT ACT 454 



370 T CT CACATGGC GGCAGAT GT CAGCT GGCAT AGT CT GGGC CT T GAACAAGGATT CCTT AGG 429 
4 81 T CT CACAT GGCGGCAGATGT CAGCT GGCAT AGT CT GGGC CTT GAACAAGGATT CCTTAGG 540 
541 T CTCACAT GGCGGCAGATGT CAGCT GGCAT AGT CT GGGCCTT GAACAAGGATT C CTT AGG 514 



430 ACCATGGGAGCT ATT GATTTT CAC GGCT C CTATT CAGAGGCT CATT C GGCT GGT GATTTT 
541 AC CAT GGGAGCT ATT GATTTT CAC GGCTCCT ATT CAGAGGCT CATT C GGCT GGT GATTTT 
515 AC CAT GGGAGCTATT GATT TT CAC GGCT C CTATT CAGAGGCT CATT C GGCTGGT GATTTT 



489 
600 
574 
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490 GGAGGAGAT GT GT T GAGC CAGTTT GAATTTAATTTTAATT ACCTT GCAC GAC GCT GGT AT 549 
601 GGAGGAGAT GT GTT GAGCC AGTTTGAATTTAATTTTAATTAC CTT GCAC GAC GCT GGT AT 660 
575 GGAGGAGAT GT GTT GAGC CAGTTT GAATTTAATTTTAATT ACCTTGCAC GAC GCT GGT AT 63 4 



550 GT GCCAGT CAAAGATCT ACT GGGAATTT AT GAGAAACTGTAT GGT C GAAAAGT CAT CAC C 609 
661 GT GC CAGT CAAAGAT CTACT GGGAATTT AT GAGAAACT GT AT GGT CGAAAAGT CAT C ACC 72 0 
635 GT GC CAGT CAAAGATCTACT GGGAATTT AT GAGAAACT GTAT GGT CGAAAAGT CAT CACC 694 



610 GAAAAT GTAAT CGTT GATT GTT CACAT ATCCAGTT CT TAGAAAT GTAT GGT GAGAT GCT A 669 
721 GAAAAT GTAAT C GTT GATT GTT CACAT AT C CAGTT CTT AGAAAT GTAT GGT GAGAT GCTA 7 80 
695 GAAAAT GTAAT C GTT GAT T GTT CACAT AT CCAGTT CT TAGAAAT GTAT GGT GAGAT GCTA 754 



67 0 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 72 9 
781 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 840 
755 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACT^iTTC 814 



730 C AAGAGT ATTTT CTT GGAGGACT GGAT GAT AT GGCATTTT GGT C CACT AATATTT ACCAT 7 8 9 
841 CAAGAGT ATTTT CTT GGAGGACT GGAT GAT AT GGCATTTT GGT CCACTAATATTT AC CAT 900 
815 CAAGAGT AT TTT CTT GGAGGACT GGAT GAT AT GGCAT TTT GGT CCACTAAT ATTTAC CAT 87 4 
GGGCT GTAAC 1 0 



7 90 CT AACAAT CTT CAT GTT GGAGAAT GGGACCAGT GACT GCAAC CT GC CT GAGAACCCT CT G 

901 CTAACAAGCT T CAT GTT GGAGAAT GGGAC CAGT GACT GCAAC CT GCCT GAGAACC CT CT G 

87 5 CT AACAAGCTT CAT GTTGGAGAAT GGGAC CAGT GACT GCAAC CT GCCT GAGAACC CT CT G 

11 T CT GC CAT CC CT CAGCATAATTT GGGGGT AT GATTT CACTAT CCTAATT GC CTGT C CTAA 



850 TT CATT GCAT GT GGCGGC CAGCAAAAC CACAC CCAGGGCT CAAAAAT GCAGAAAAAT GAT 909 

961 TT CATT GCAT GT GGC GGC CAGCAAAAC C ACACC CAGGGCT CAAAAAT GCAGAAAAAT GAT 1020 

935 TT CATT GCAT GT GGC GGC CAGCAAAAC CACAC CCAGGGCT CAAAAAT GCAGAAAAAT GAT 994 

7 1 GTGAT CTT ACTT GCT GAT AGGACCTAAT GTT TTATTTT ATT GTT TAGCACTT CT AAAAAC 130 

910 TTT CACAGAAATTT GACT ACAT CC CTAACT GAAAGT GTTGACAGGAATATAAACTAT ACT 969 

1021 TTT CACAGAAAT TT GACT ACAT CC CTAACT GAAAGT GTT GACAGGAATATAAACTATACT 1080 

995 TTT CACAGAAATTT GACT ACATCCCTAACT GAAAGT GTTGACAGGAATATAAACTAT ACT 1054 

131 T CATTT CCTTTACACAAGT C CAAT ACTTT GGACAGGAAACAGTAGCTTT GTT GATTATGC 180 



970 GAAAGAGGAGT GT T CTTTAGT GT AAATT C CT GGAC C CC GGATT C CAT GT C CT TT AT CT AC 1029 

1081 GAAAGAGGAGTGTTCTTTAGTGTAAATTCCTGGACCCCGGATTCCATGTCCTTTATCTAC 1140 

1055 GAAAGAGGAGT GT T CTTTAGT GT AAATT CCT GGAC C C C GGATT C CAT GT C CTTTAT CT AC 1114 

181 TACGTGTCTTTACTGTCTATAATGATTCTTTTATTTCAGGATTCCATGTCCTTTATCTAC 240 



1030 AAGGCTTTGGAAAGGAACATAAGGACAATGTTCATAGGTGGCTCTCAGTT GT CACAAAAG 1089 

1141 AAGGCTTTGGAAAGGAACATAAGGACAATGTTCATAGGT GGCTCTCAGTTGT CACAAAAG 1200 

1115 AAGGCTTTGGAAAGGAACATAAGGACAATGTTCATAGGTGGCTCTCAGTTGTCACAAAAG 1174 

241 AAGGCTTT GGAAAGGAACATAAGGACAATGTT CAT AGGT GGCT CT CAGTT GT CACAAAAG 300 



109 0 CAC GTCTCCAGCCC CTT AGCATCTTACTTCTTGT CATTT C CTT AT GCGAGGCTTGGCTGG 1149 

1201 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1260 

1175 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1234 

301 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 360 
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1150 GCAATGACCTCAGCTGACCTCAACCAGGATGGGCACGGTGACCTCGTGGTGGGCGCACCA 1209 
^261 GCAATGACCTCAGCTGACCTCAACCAGGATGGGCACGGTGACCTCGTGGTGGGCGCACCA 1320 
1235 GCAATGACCTCAGCTGACCTCAACCAGGATGGGCACGGTGACCTCGTGGTGGGCGCACCA 
361 GCAATGACCTCAGCTGACCTCAACCAGGATGGGCACGGTGACCTCGTGGTGGGCGCACCA 



1294 
420 



1269 



1210 GGCTACAGCCGCCCCGGCCACATCCACATCGGGCGCGTGTACCTCATCTACGGCAATGAC 

1321 GGCTACAGCCGCCCCGGCCACATCCACATCGGGCGCGTGTACCTCATCTACGGCAATGAC 1380 
1295 GGCTACAGCCGCCCCGGCCACATCCACATCGGGCGCGTGTACCTCATCTACGGCAATGAC 
421 GGCTACAGCCGCCCCGGCCACATCCACATCGGGCGCGTGTACCTCATCTACGGCAATGAC 

127 0 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 

1381 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 1440 

1355 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 

481 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 



1329 



1414 
540 



1330 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 1389 

1441 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 1500 

1415 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 1474 

541 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 600 



1449 



1390 GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 

llol gScSgacctggccgtgggagctcc^ J560 

GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 1534 
GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 



1475 
601 



660 



1450 GCCGTGTAT 



GTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 



1509 

1561 GCCGTGTATGTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 1620 
1535 GCCGTGTATGTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 1594 
661 GCCGTGTATGTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 720 

1510 ATTTCTTGCCAGGACATCTACTGTAACTTGGGCTGGACTCTCTTGGCTGCAGATGTGAAT 1569 
1621 ATTTCTTGCCAGGACATCTACTGTAACTTGGGCTGGACTCTCTTGGCTGCAGATGTGAAT "> 
1595 ATTTCTTGCCAGGACATCTACTGTAACTTGGGCTGGACTCTCTTGGCTGCAGATGTGAAT 
721 ATTTCTTGCCAGGACATCTACTGTAACTTGGGCTGGACTCTCTTGGCTGCAGATGTGAAT 



1680 
654 
780 



1570 GGAGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 
1681 GGAGAGAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 
1655 GGAGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 
781 GGAGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 



1629 
1740 
1714 
840 



1630 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 1689 

1741 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 1800 

1715 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 177 4 

841 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 900 



1690 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 1749 

1801 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 1860 

1775 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 1834 

901 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 960 



1750 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1809 

18 61 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1920 

1835 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1894 

961 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1020 



4 



1810 AATGCCAGCAGGCTGGGCCATTTGTTACACATCCGAGATGAGAAAAAGAGCCTTGGGAGG 18 69 

1921 AAT GC CAGC AGGCT GGGC CAT T T GTT ACACAT C C GAGAT GAGAAAAAGAGC CT T GGGAGG 1980 

1895 AAT GC C AGC AGGC T GGGC CAT T T GTT ACACAT C C GAGAT GAGAAAAAGAGC CT T GGGAGG 1954 

1021 AATGCCAGCAGGCTGGGCCATTTGTTACACATCCGAGATGAGAAAAAGAGCCTTGGGAGG 1080 



8-4- 



1870 GT GTAT GGCT ACTT CCCTVCCAAAC GGC CAAAGCTGGTTT AC CATTTCT GGAGACAAGGCA 1929 

1981 GTGTATGGCTACTTCCCACCAAACGGCCAAAGCTGGTTTACCATTTCTGGAGACAAGGCA 2040 

1955 GT GT AT GGCTACT T C CCACCAAAC GGC CAAAGCT GGTTT ACCATTT CT GGAGACAAGGCA 2014 

1081 GT GTAT GGCT ACTT CC CAC CAAACGGCCAAAGCT GGTTTAC CATTT CT GGAGACAAGGCA 1140 



1930 ATGI 



GGGAAACT GGGT ACTT CC CTTT C CAGT GGC CAC GTACT GAT GAAT GGGACT CT GAAA 



JGAAACT GGGT ACTT C CCTTT C CAGT GGCCAC GTACT GATGAAT GGGACT CT GAAA 
2015 AT GGGGAAACT GGGT ACTT C C CTTT C CAGT GGC CAC GTACT GAT GAAT GGGACT CT GAAA 
1141 AT GGGGAAACT GGGT ACTT CCCTTT C CAGT GGCCAC GTACT GAT GAAT GGGACT CT GAAA 



1989 
2100 



1990 CAAGT GCT GCT GGTT GGAGC C C CTAC GTACGAT GAC GT GT CT AAGGT GGCATT CCT GAC C 
2101 C^LAGTGCTGCTGGTTGGAGCCCCTACGTACGATGACGTGTCTJVAGGTGGCATTCCTGACC 

2075 CAAGTGCTGCTGGTTGGAGCCCCTACGTACGATGACGTGTCTAAGGTGGCATTCCTGACC 

1201 CAAGT GCT GCT GGTT GGAGC C C CTAC GTACGAT GAC GTGTCT AAGGT GGCATT C CT GACC 1260 



2049 
2160 
2134 



2050 



GT GAC CCTACACCAAGGCGGAGC CACT C GCAT GTAC GCACT CATATCT GAC GC GCAGCCT 



2109 

2161 GT GACCCTACAC CAAGGCGGAGCCACT C GCAT GTAC GCACT CATAT CT GACGCGCAGC CT 2220 

2135 GT GACC CT ACAC CAAGGCGGAGC CACT C GCAT GTAC GCACT CATATCT GAC GC GCAGC CT 2194 
1261 GT GACCCTACAC CAAGGC GGAGC CACT C GCAT GTAC GCACT CATAT CT GACGC GCAGCCT 



1320 



2110 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2169 

2221 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2280 

2195 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2254 

1321 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 138 0 



2229 
2340 
2314 
1440 



2289 



217 0 T T GAGT GACCT GGAT GAT GAT GGCT T AGAT GAAAT CAT CAT GGCAGC CC CC CT GAGGATA 

2281 TT GAGT GACCT GGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCCCCC CT GAGGATA 

2255 TT GAGT GACCT GGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCCC CCCT GAGGATA 

1381 TT GAGT GACCT GGAT GAT GAT GGCTTAGAT GAAAT CATCAT GGCAGC CCCCCT GAGGATA 

2230 GCAGAT GTAAC CT CT GGACT GATT GGGGGAGAAGAC GGCCGAGT ATAT GTAT AT AAT GGC 

2341 GCAGAT GTAACCT CT GGACT GATT GGGGGAGAAGACGGC CGAGTAT AT GTATAT AATGGC 2400 

2315 GCAGAT GTAAC CTCT GGACT GATT GGGGGAGAAGAC GGCCGAGT AT AT GTAT AT AAT GGC 2374 

1441 GCAGAT GTAACCT CT GGACT GATT GGGGGAGAAGAC GGC C GAGT AT AT GTAT AT AAT GGC 1500 

2290 AAAGAGAC CAC CCTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGATAACT C CAT GT CCA 2349 

2401 AAAGAGAC CAC C CTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGATAACT CCAT GT C CA 2460 

2375 AAAGAGACC AC CCT T GGT GACAT GACT GGCAAAT GCAAAT CAT GGATAACT C CAT GT C CA 2434 

1501 AAAGAGAC CAC CCTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGAT AACT C CAT GT C CA 1560 



2350 GAAGAAAAGGC C CAAT AT GT ATT GATTT CT CCT GAAGC C AGCT CAAGGTTT GGGAGCT CC 2409 

2461 GAAGAAAAGGCCCAATAT GT ATT GATTT CT C CT GAAGC CAGCT CAAGGTTT GGGAGCT C C 2520 

2435 GAAGAAAAGGT AAGT GAAAAAAAAAAAAAAAAAAAAAA 2 472 

1561 GAAGAAAAGGCCCAATAT GT ATT GATTT CT C CT GAAGC CAGCT CAAGGTTT GGGAGCT C C 1620 



2410 CT CAT CAC C GT GAGGT CCAAGGCAAAGAAC CAAGT CGT CATT GCT GCT GGAAGGAGTTCT 2469 
2521 CT CAT CACCGT GAGGT C CAAGGCAAAGAAC CAAGT CGT CATT GCT GCT GGAAGGAGTT CT 2580 



1621 CT CAT CAC CGT GAGGT CCAAGGCAAAGAAC CAAGT CGT CATT GCTGCT GGAAGGAGTTCT 1680 



4 



IT 1 



X741 ic^m^CTCT^CCTC^ 1800 
180! ^^^S^^ra^GCGGTGSTAGATCCTGATAGACATOG.iGCTC I860 

186 1 CTGCGACTAGAGRGACACACTRRCAGCCACACCCTCTGGAARTCTGRTACAGTAARTATA 1520 
jQ^^^ftftflAAAftAABAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



2821 TGACTGCACCAG 

1952 

1921 TGACTGCACCAGAAAAAAAAAAAAAAAAAAAA 



2770 GTTTACTTCCTCGCTCCCTTTGCATCTAAACCTTTCTTCTTTCCCAACTTATTGCCTGTA 2829 
2881 MAAAAAAAAARRRAAAAAAAAAAAAARAAAAA^--------- 



2830 GTC^GACCTGCTGTACAACCTATTTCCTCTTCCTCTTGAATGTCTTTCCAGTG^CTG^AA 2889 



2890 AGGTCCCTCTGTGGTTATCTGTTAGAACAGTCTCTGTACACAATTCCTCCTAAAAACATC 2949 



2950 CTTTTTTAAAAAAAGAATTG 1 



TTCAGCCATAAAGAAAGAACAAGATCATGCCCTTTGCAGG 3009 



3010 c^CATGGATGGAGCTGGAGGCCATTATCCTTCATAAACTATTGCAGG^^G^C^ 



3069 



3070 ACACTCCATATTCTCACTTGTAAGTGGGAGCTAAGTGAGAA^CGTG^^TAGAGGG 3129 



%5 



3130 AAACAAC AC ACACT GGGGC C TAT GAGAGGGC GGAAGGT GGGAGGAGGGAGAGAT C AGGAA 318 9 



3190 AAATAACTAAT GGATACTTAGGGT GAT GAAATAAT CT GTGT AACAAAC C C C CAT GACACA 3249 



3250 CCTTTATGTATGTAACAAACCAGCACTTCCTGCGCATGTACCCCTGAACTTAAAAGTTAA 3309 



3310 AAAAAAGT T GAAC T T AAAAAT AAC AGAT T GGC C CAT GC CAAT C AAAGT AT AAT AGAAAGC 3369 





3370 ATAGTATAC 3378 



